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5 Lipid phosphate phosphatases and uses thereof for treating neuronal diseases 

The present invention relates to lipid phosphate phosphatase proteins, genes coding for them, 
vectors and cells comprising them, antibodies directed against them, methods of identifying 
compounds binding to them and functional interactors as well as to the use of proteins, genes, 
10 vectors, cells, interacting compounds and functional interactors for treating neuronal diseases 
and/or injuries. 

Axons in the central nervous system (CNS) elongate through the extracellular space over long 
distances (N. Tessier-Lavigne and C. S. Goodman (1996) Science 274:1123-1133). This oc- 

15 curs during development (C. S. Goodman (1996) Annu. Rev. Neurosci.l9:341-377 and H. 
Super and E. Soriano (1994) J. Comp. Neurol. 344:101-120) and during axonal sprouting in 
response to partial deafferentation (C. Cotman et al. (1977) J. Neurocytol. 6:455-464 and M. 
Frotscher et al. (1997) Trends Neurosci. 20:218-223). The extracellular space, however, is an 
outgrowth repellent environment that allows axonal elongation only under specific molecular 

20 conditions (E. Stein and N. Tessier-Lavigne (2001) Science 291:1928-1938). Molecules in- 
volved in axonal outgrowth, such as semaphoring netrins, or ephrins (S. A. Colamarino and 
M. Tessier-Lavigne (1995) Cell 81:621-629, H Kobayashi et al. (1997) J. Neurosci. 17:8339- 
8352, E. Stein et al. (1999) J. Neurosci. 19:8585-8893 and A. Steup et al. (2000) Mol. Cell 
Neurosci. 15:141-155) are able to transduce outgrowth promoting as well as inhibiting signals 

25 to elongating axons via specific receptors. 

In the hippocampus, afferent connections from the entorhinal cortex enter in a layer-specific 
manner during development (T. Skutella and R. Nitsch (2001) Trends Neurosci. 24:107-1 13). 
This specific axonal navigation depends on molecular cues expressed along the pathway and 

30 in the target region (T. Skutella and R. Nitsch, supra) Transection of entorhinal axons in the 
adult leads to a specific deafferentation in the hippocampus with subsequent regenerative 
axon sprouting by remaining afferents into the denervated zones (C. Cotman et al, supra and 
D.A Matthews et al. (1976) Brain Res. 1 15:23-41). It has been shown that signaling via bio- 
active lipid phosphates such as phosphatidate (PA), lysophosphatidate (LP A) or sphingosine- 

35 1 -phosphate (S-l-P) are involved in cell migration, mitogenesis and neurite retraction (K. 
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Jalink et al. (1994) Biochim. Biophys. Acta 1198:185-196, W. H. Moolenaar (1995) Curr. 
Opin. Cell Biol. 7:203-210 and N. Zhang et al (1997) Nature 385:64-67) and in particular it 
has been shown that signaling via extracellular LPA plays an important role in CNS develop- 
ment and that postmitotic neurons are at least one endogenous source for LPA in the nervous 
5 system (N. Fukushima et al (2000) Dev. Biol. 228:6-18). LPA has properties of an extracel- 
lular neurite repellent factor (K. Jalink (1994) supra and K. Jalink et al (1993) Cell Growth 
Differ. 4:247-255). It is present in the extracellular space of the nervous system (Fukushima et 
al. (2000) supra and J. Bothmer et al (1992) Neurochem. Int. 21:223-228) and mediates di- 
verse cellular responses through the activation of multiple signal transduction pathways (W. 
10 H. Moolenaar (1995) supra). One major structural effect of LPA on neurons is rapid neurite 
retraction with subsequent cell rounding. Therefore, LPA and similar bioactive lipid phos- 
phatases inhibit a regrowth of axons following neuronal lesion. Therefore, it is a problem 
known in the art, that after neuronal damage due to, for example, neuronal disease or trauma a 
regrowth of axons does not occur. 

15 

Within the context of the present invention it has been found that the expression of a family of 
genes called plasticity-related genes (PRGs) overcomes the repellent effect of bioactive lipid 
phosphates, in particular of LPA and, thus, allows the regrowth of axons in spite of the pres- 
ence of bioactive lipid phosphates. Therefore, the present invention is directed at an isolated 

20 protein comprising the same or substantially the same amino acid sequence selected from the 
group consisting of human PRG-1, human PRG-2, mouse PRG-1, mouse PRG-2 and rat PRG- 
1 (depicted in SEQ ID NOs: 1 to 5), respectively, or a splice variant or a salt thereof. A pro- 
tein having substantially the same amino acid sequence comprises proteins with at least about 
95%, preferably at least about 96%, more preferably at least about 97%, more preferably with 

25 at least about 98% and most preferably with at least about 99% amino acid sequence identity. 
The amino acid exchanges are preferably so called conservative changes meaning substitu- 
tions of, for example, a polar amino acid residue by another polar amino acid residue, of a 
acidic amino acid residue by another acidic amino acid residue or of a basic amino acid resi- 
due by another basic amino acid residue. 

30 

Proteins having substantially the same amino acid within the meaning of this invention ex- 
hibit in a preferred embodiment lipid phosphate phosphatase activity The lipid phosphate 
phosphatase activity of a given protein with substantially the same amino acid can be tested, 
for example, by the ectophosphatase assay described in example 1 1 below. The proteins em- 



ployed in the assay can either be purified from cells or can be recombinantly expressed and 
purified by methods well known in the art. 



In one embodiment of the present invention the protein comprises at least one fragment of the 
5 human PRG-1 and PRG-2 or mouse PRG-1 and PRG-2 or rat PRG-1. A fragment within the 
meaning of the present invention refers to one of the proteins according to SEQ ID NOs: 1 to 
5 bearing at least one N-terminal, C-terminal and/or internal deletion. The resulting fragment 
has a length of at least about 50, preferably of at least about 100, more preferably of at least 
about 150, more preferably of at least about 200, more preferably of at least about 250, more 
10 preferably of at least about 300, more preferably of at least about 350 and most preferably of 
at least about 400 amino acids. 



Preferably the fragment is an N-terminal fragment which comprises 330 amino acids or less 
as outlined above, which are highly conserved between, for example, PRG-1 and members of 

15 the family of LPP membrane-associated phosphatic acid phosphatase ecto-enzymes, which 
have six membrane spanning domains with their active site located on the external surface of 
the plasma membrane. This domain comprises preferably the catalytic region. For example, 
human PRG-1 carries a catalytic histidine at position 252, which is involved in the phospha- 
tase activity of human PRG-1. Therefore, in a preferred embodiment any N-terminal fragment 

20 of the proteins of the present invention comprises the catalytic site including the conserved 
His-residue. The fragment itself has preferably an amino acid sequence identity with hPRG-1, 
hPRG-2, mPRG-1, mPRG-2 and rat PRG-1, respectively, of at least about 95%, preferably of 
at least about 96%, more preferably of at least about 97%, more preferably of at least about 
98%, more preferably of at least about 99% and most preferably of 1 00%. 

25 

The C-terminal cytoplasmatic part of the PRG proteins is potentially involved in regulation of 
lipid phosphate phosphatase activity and/or signaling and, thus, a further preferred fragment 
comprises a C-terminal fragment, which comprises about 413 amino acids or less as outlined 
above and which comprises regions required for above activity of the PRG proteins. The 
30 fragment itself has preferably an amino acid sequence identity with hPRG- 1 , hPRG-2, mPRG- 
1, mPRG-2 and rat PRG-1, respectively, of at least about 95%, preferably of at least about 
96%, more preferably of at least about 97%, more preferably of at least about 98%, more 
preferably of at least about 99% and most preferably of 100%. 
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In a further aspect the present invention is directed at a nucleic acid, which comprises at least 
one nucleic acid encoding one of the proteins of the present invention. Preferably the nucleic 
acid consists of DNA or RNA, wherein the DNA preferentially is either single or double 
stranded. Also comprised are DNA's, which hybridize to one of the aforementioned DNA's 
5 under stringent conditions like, for example, hybridization at 60°C in 2.5 x SSC buffer and 
several washes at 37°C at a lower buffer concentration like, for example, 0,5 x SSC buffer 
and which encode proteins exhibiting lipid phosphate phosphatase activity and/or association 
with plasma membranes. Additional reagents required for carrying out stringent Northern or 
Southern blots like, for example, single stranded salmon sperm DNA are well known in the 
10 art. 

In a preferred embodiment of the nucleic acid of the present invention the nucleic acid com- 
prises a nucleic acid selected from the group consisting of the human PRG-1 gene, the human 
PRG-2 gene, the mouse PRG-1 gene, the mouse PRG-2 gene and the rat PRG-1 gene (see 
15 SEQIDNOs:6tolO). 

In a further embodiment the nucleic acid of the present invention further comprises at least 
one promoter, enhancer, intron and/or polyA-sequence. Preferred promoters or enhancers 
posses tissue specificity, in particular neuronal specificity and more particular a specificity for 
20 growing neurons. Examples of such promoters and/or enhancers are the neuron specific eno- 
lase promoter (Erickson, R.P. and Bernard, O. (2002) J. Neuro Science Res 68:738-44), the 
peripherin promoter (Weinstein, D.E. et al. (1999) Brain Res. Dev. Brain Res. 1 16:29-39), the 
synapsin promoter (Flood, D.G. et al. (1999) Am. J. Pathol. 155:663-72) and the Thy 1 pro- 
moter (Kahle, P. J. et al. (2001) Am. J. Pathol. 159:2215-25). 

25 

In some instances it might be desirable to interfere with, for example, the transcription or 
translation of the nucleic acids of the present invention and, therefore, the present invention is 
also directed at a nucleic acid, which is complementary to the nucleic acid of the present in- 
vention and, thus, is capable of inhibiting, for example, transcription or translation. A pre- 
30 ferred embodiment of such a complementary nucleic acid is a so called anti-sense oligonu- 
cleotide (R. Q. Zheng and D. M. Kemeny (1995) Clin. Exp. Immunol. 100:380-2, W. Nellen 
and C. Lichtenstein (1993) Trends. Biochem. Sci. 18:419-423 and C. A. Stein (1992) Leuke- 
mia 6:967-74), ribozymes (M. Amarzguioui and H. Prydz (1998) Cell. Mol. Life Sci. 
54:1175-1202, N. K. Vaish et al (1998) Nucleic Acids Res. 96:5237-5242, Persidis (1997) 



Nat. Biotechnol. 15:921-922 and L. A. Couture and D. T. Stinchcomb (1996) Trends Genet. 
12:510-515) and/or so called small interfering RNA-molecules (siRNAs) (S. M. Elbashir et 
al. (2001) Nature 411:494-498). Anti-sense oligonucleotides are able to decrease the stability 
of the above described nucleic acids and/or can inhibit the translation. Similarly the use of 
5 siRNA-oligonucleotides also leads to a reduction in the amount of the translated polypeptides. 
The anti-sense oligonucleotides have in a preferred embodiment a length of at least 20, pref- 
erable of at least about 30, more preferably of at least about 40 and most preferably a length 
of at least about 50 nucleic acids. 

10 Oligonucleotides are generally rapidly degraded by endo- or exonucleases, which are present 
in the cell, in particular by DNases und RNases and, therefore, it is advantageous to modify 
the nucleic acids which are used, for example, in anti-sense strategies, as ribozymes or siR- 
NAs to stabilize them against degradation and thereby prolong the time over which an effec- 
tive amount of the nucleic acid is maintained within the cell (L. Beigelmann et al. (1995) Nu- 

15 cleic acids Res. 23:3989-94, WO 95/11910, WO 98/37340 and WO 97/29116). Typically 
such a stabilization can be obtained by the introduction of one or more internucleotide phos- 
phate groups and/or by the introduction of one or more non-phosphor-internucleotides. 

Suitable modified internucleotides are summarized in, for example, Uhlmann and Peimann 
20 (1990) Can. Rev. 90:544. Modified internucleotide phosphate residues and/or non-phosphate 
bridges which can be used in a nucleic acid of the invention comprise, for example, methyl- 
phosphonate, phosphorthioate, phosphoramidate, phosphordithionate, phosphate ester, non- 
phosphor internucleotide analogues, which can be used in nucleic acids of the invention in- 
clude, for example, siloxane bridges, carbonate bridges, carboxymethylester, acetamid bridges 
25 and/or thio ether bridges. 

A further aspect of the present invention is directed at a vector comprising a protein according 
to the present invention and/or a nucleic acid according to the present invention. A vector 
within the meaning of the present invention is a protein or a nucleic acid or a mixture thereof 
30 which is capable of being introduced or of introducing the proteins and/or nucleic acid com- 
prised into a cell. It is preferred that the proteins encoded by the introduced nucleic acid are 
expressed within the cell upon introduction of the vector. 
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In a preferred embodiment the vector of the present invention comprises plasmids, 
phagemids, phages, cosmids, artificial mammalian chromosomes, knock-out or knock-in con- 
structs, viruses, in particular adenovirus, vaccinia virus, lentivirus (Chang, L.J. and Gay, E.E. 
(20001) Curr. Gene Therap. 1:237-251), Herpes simplex virus (HSV-1, Carlezon, W.A. et al. 
5 (2000) Crit. Rev. Neurobiol.), baculovirus, retrovirus, adeno-associated-virus (AAV, Carter, 
P.J. and Samulski, R.J. (2000) J. Mol. Med. 6:17-27), rhinovirus, human immune deficiency 
virus (HIV), filovirus and engineered versions thereof (see, for example, Cobinger G. P. et al 
(2001) Nat. Biotechnol. 19:225-30), virosomes, "naked" DNA liposomes, and nucleic acid 
coated particles, in particular gold spheres. Particularly preferred are viral vectors like adeno- 

10 viral vectors or retroviral vectors (Lindemann et al. (1997) Mol. Med. 3:466-76 and Springer 
et al. (1998) Mol. Cell. 2:549-58). Liposomes are usually small unilamellar or multilamellar 
vesicles made of cationic and anionic lipids, for example, by ultrasound treatment of liposo- 
mal suspensions. The DNA can, for example, be ionically bound to the surface of the lipo- 
somes or internally enclosed in the liposome. Suitable lipid mixtures are known in the art and 

15 comprise, for example, DOTMA (1, 2-Dioleyloxpropyl-3-trimethylammoniumbromid) and 
DPOE (Dioleoylphosphatidylethanolamin) which both have been used on a variety of cell 
lines. 

Nucleic acid coated particles are another means for the introduction of nucleic acids into cells 
20 using so called "gene guns", which allow the mechanical introduction of particles into the 
cells. Preferably the particles itself are inert, and therefore, are in a preferred embodiment 
made out of gold spheres. 

In a further aspect the present invention is directed at an isolated cell comprising a protein of 
25 the present invention, a nucleic acid of the present invention and/or a vector of the present 
invention. Cells of the present invention can be prokaryotic or eukaryotic cells and in a pre- 
ferred embodiment the cells of the present invention are stem cells, in particular non-human 
embryonic stem cells, embryonic stem cell lines, foetal stem cells, adult stem cells, neuronal 
precursor cells or neuronal cells in particular axons (Hsich, G. et al. (2002) Hum. Gene 
30 Therap., 13:579-604 and Martinez-Serrano, A. et al. (2001) Curr. Gene Therap. 1:279-299). 

A further aspect of the present invention is a transgenic non-human animal generated from a 
cell of the present invention. The animal can be a mosaic animal, which means that only part 
of the cells making up the body comprise cells of the present invention or all cells of the ani- 



mal can be derived from a cell of the present invention and those animals can be either homo- 
or heterozygous with respect to the nucleic acid of the present invention contained within the 
cell of the present invention. In a preferred embodiment the transgenic animals are either 
homo- or heterozygous knock-out or knock-in animals with respect to the genes which code 
5 for the proteins of the present invention. 

In a further aspect the present invention is directed at an antibody directed against a protein of 
the present invention. The term "antibody" comprises monoclonal and polyclonal antibodies 
and binding fragments thereof, in particular Fc-fragments as well as so called "single-chain- 
1 0 antibodies" (Bird R. E. et al (1 988) Science 242:423-6) and diabodies (Holliger P. et al ( 1 993) 
Proc. Natl. Acad. Sci. U.S.A. 90:6444-8). 

In a further aspect the present invention is directed at a method of producing a protein of the 
present invention or a nucleic acid of the present invention and comprises the steps of: a) cul- 

15 tivating a cell of the present invention and b) isolating the protein and/or the nucleic acid. If 
the method is used primarily to isolate nucleic acids then in an preferred embodiment the 
cells, which are used are prokaryotic cells, in particular R coli. cells If the method is used 
primarily for the isolation of proteins of the invention than the cells can be either of prokary- 
otic or eukaryotic origin. Someone of skill in the art is aware of a variety of different cell 

20 types suitable for the production of proteins like, for example, E. coli, Sf9, Hi5, P. pastoris 
and HeLa. Eukaryotic cells are preferably chosen, if it is desired that the proteins produced by 
the cells exhibit an essentially natural pattern of glycosylation and prokaryotic cells are cho- 
sen, if, for example, glycosylation or other modifications, which are normally introduced into 
proteins only in eukaryotic cells, are not desired or not needed. 

25 

In a further aspect the present invention is directed at a method of isolating compounds inter- 
acting with a protein of the present invention comprising the steps of: a) contacting one or 
more of the proteins of the present invention, preferably one, with at least one potentially in- 
teracting compound, and b) measuring binding of said compound to said protein. This method 
30 is suitable for the determination of compounds that can interact with the proteins of the pres- 
ent invention and to identify, for example, inhibitors, activators, competitors or modulators of 
proteins of the present invention, in particular inhibitors, activators, competitors or modula- 
tors of the enzymatic activity of the proteins of the present invention. 



The potentially binding substance, whose binding to the protein of the present invention is to 
be measured, can be any chemical substance or any mixture thereof. For example, it can be a 
substance of a peptide library, a combinatory library, a cell extract, in particular a plant cell 
extract, a "small molecular drug", a protein and/or a protein fragment. 

The term "contacting" in the present invention means any interaction between the potentially 
binding substance(s) with the proteins of the invention, whereby any of the two components 
can be independently of each other in a liquid phase, for example in solution, or in suspension 
or can be bound to a solid phase, for example, in the form of an essentially planar surface or 
in the form of particles, pearls or the like. In a preferred embodiment a multitude of different 
potentially binding substances are immobilized on a solid surface like, for example, on a 
compound library chip and the protein of the present invention is subsequently contacted with 
such a chip. 

The proteins of the present invention employed in a method of the present invention can be 
full length proteins or a fragments with N/C-terminal and/or internal deletions. Preferably the 
fragments are either N-terminal fragments comprising the enzymatic region of the protein or 
C-terminal fragments comprising the cytoplasmic region, depending on whether potentially 
interacting compounds are sought that specifically interact with the N- or C-terminal fragment 

Measuring of binding of the compound to the protein can be carried out either by measuring a 
marker that can be attached either to the protein or to the potentially interacting compound. 
Suitable markers are known to someone of skill in the art and comprise, for example, fluores- 
cence or radioactive markers. The binding of the two components can, however, also be 
measured by the change of an electrochemical parameter of the binding compound or of the 
protein, e.g. a change of the redox properties of either the protein or the binding compound, 
upon binding. Suitable methods of detecting such changes comprise, for example, potenti- 
ometric methods. Further methods for detecting and/or measuring the binding of the two 
components to each other are known in the art and can without limitation also be used to 
measure the binding of the potential interacting compound to the protein or protein fragments 
of the present invention. The effect of the binding of the compound or the activity of the pro- 
tein can also be measured indirectly, for example, by assaying the phosphatase activity of the 
protein after binding. 



As a further step after measuring the binding of a potentially interacting compound and after 
having measured at least two different potentially interacting compounds at least one com- 
pound can be selected, for example, on grounds of the measured binding activity or on 
grounds of the detected increase or decrease of protein activity, in particular lipid phosphate 
5 phosphatase activity upon binding. The phosphatase activity can be measured, for example, as 
described in example 1 1 . 

The thus selected binding compound is than in a preferred embodiment modified in a further 
step. Modification can be effected by a variety of methods known in the art, which include 

10 without limitation the introduction of novel side chains or the exchange of functional groups 
like, for example, introduction of halogens, in particular F, CI or Br, the introduction of lower 
alkyl groups, preferably having one to five carbon atoms like, for example, methyl, ethyl, n- 
propyl, isopropyl, n-butyl, isobutyl, tert-butyl, n-pentyl or iso-pentyl groups, lower alkenyl 
groups, preferably having two to five carbon atoms, lower alkinyl groups, preferably having 

15 two to five carbon atoms or through the introduction of, for example, a group selected from 
the group consisting of NH 2 , N0 2 , OH, SH, NH, CN, aryl, heteroaryl, COH or COOH group. 

The thus modified binding substances are than individually tested with the method of the pre- 
sent invention, i.e. they are contacted with the protein and subsequently binding of the modi- 

20 fied compounds to the protein is measured. In this step both the binding per se can be meas- 
ured and/or the effect of the function of the protein like, e.g. the enzymatic activity of the 
protein can be measured. If needed the steps of selecting the binding compound, modifying 
the binding compound, contacting the binding compound with a protein of the invention and 
measuring the binding of the modified compounds to the protein can be repeated a third or 

25 any given number of times as required. The above described method is also termed "directed 
evolution" since it involves a multitude of steps including modification and selection, 
whereby binding compounds are selected in an "evolutionary" process optimizing its capa- 
bilities with respect to a particular property, e.g. its binding activity, its ability to activate, 
inhibit or modulate the activity, in particular the phosphatase activity of the proteins of the 

3 0 present invention. 



A further aspect of the present invention is a method of isolating compounds functionally in- 
teracting with the activity of the proteins of the present invention comprising the steps of: a) 
contacting a neuronal cell that comprises a wt nucleic acid coding for a protein selected from 
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the group consisting of SEQ ID NOs: 1 to 5, a splice variant thereof, or a fragment thereof 
with a potential functional interactor, b) contacting the cell with a bioactive lipid phosphate, 
and c) measuring neurite movement or phosphatase activity. 

5 The term "contacting" has to be understood as previously defined and comprises any possi- 
bility of interaction between the potential functional interactor and a neuronal cell. Contacting 
also comprises the introduction of the potential functional interactor into the neuronal cell 
which can be effected by a variety of methods including, for example, electroporation, which 
allows influx of a potential functional interactor contained in the medium surrounding the 

10 neuronal cell into the neuronal cell. A neuronal cell that comprises a wt nucleic acid coding 
for a PRG protein (as indicated in SEQ ID NOs: 1 to 5) can be any neuronal cell capable of 
neurite movement. The cell may or may not express the wild-type nucleic acid depending on, 
for example, the developmental stage of the neuronal cell. For example entorhinal cortex cells 
of embryonic day 16 (El 6), which exhibit neurite movement, do not express PRG-1 while 

15 entorhinal cortex cells of postnatal day 0 (P0) do express PRG-1. The choice of either a cell 
that already expresses or does not express the wild-type nucleic acids will depend on the 
functional interaction of the potential functional interactor that is sought. If, for example, a 
functional interactor is sought, that activates transcription within the cell normally not ex- 
pressing PRG-1 than, for example, an El 6 cell could be chosen. If functional interactors are 

20 sought that primarily interact on the protein level, i.e. that activate or suppress phosphatase 
activity of already expressed PRG-1 than neuronal cells expressing PRG-1 would be chosen 
like, for example, P0 cells. 

Bioactive lipid phosphates are lipid phosphates which inhibit neurite movement of neuronal 
25 cells, which do not express PRG-1 cells, like for example, E16 cells. Examples of such bioac- 
tive lipid phosphates comprise PA, LP A and S-l-P. Whether a lipid phosphate, which can be 
used in the method of the present invention is bioactive can be determined by, for example, 
the experiment described in Example 9. 

30 Methods for measuring neurite movements are well-known in the art and are described, for 
example, in Savaskan et al. (1999) European J. Neurosci. 1 1 319-326. One way of measuring 
neurite movement is scoring of the cells after contacting with a bioactive lipid phosphate. It is 
possible to categorize the cells into at least three different categories, i.e. round cells, cells 
with short processes and cells with long processes. It is also possible to quantify the effect by 
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measuring the length of the neurite processes. The effect of the potential functional interactor 
can be determined by comparing the neurite movement of the neuronal cell, the categories the 
cells are in or the length of the neurite processes after contacting the cell with a bioactive lipid 
phosphate with or without the functional interactor. Other ways to assess the effect of poten- 
5 tial functional interactors are the determination of the expression level of the PRG genes 
and/or proteins or the enzymatic activity of the proteins. 

In a further embodiment the method includes the additional steps of: a) contacting a neuronal 
ceil that comprises a mutant nucleic acid coding for a mutant of the proteins selected from the 

10 group consisting of SEQ ID NOs: 1 to 5, a splice variant thereof, or a fragment thereof or that 
contains a knock-out of the wt nucleic acid coding for one of said proteins with a potential 
functional interactor, b) contacting said cell with a bioactive lipid phosphate, and c) measur- 
ing neurite movement. The above described neuronal cell is preferably incapable of express- 
ing a functional PRG protein and, thus, can not be stimulated by any functional interactor to 

15 activate PRG genes or PRG protein function. Therefore, any potential functional interactor, 
which shows an effect on the neurite movement of neuronal cells comprising wild-type PRG 
genes but shows no effect in the neuronal cell comprising mutants or knock-out PGR genes 
have thereby been shown to functional interact with the PRG genes. Once such an interactor 
has been identified the mode of functional interaction can be further analyzed and to that end 

20 the amount of PRG mRNA and/or protein expressed or the activity of the PRG protein can be 
determined by a variety of different techniques, which are known in the art. 

In a preferred embodiment the method of the invention comprises the further steps of: a) 
modifying the functional interactor to generate a variety of modified functional interactors, b) 

25 contacting a neuronal cell comprising a wild-type nucleic acid coding for a protein selected 
from the group consisting of SEQ ID NOs: 1 to 5, a splice variant thereof, or a fragment 
thereof and if needed a cell that comprises a mutant nucleic acid coding for a mutant of the 
protein selected from the group consisting of SEQ ID NOs: 1 to 5, a splice variant thereof, or 
a fragment thereof with the modified functional interactors, c) contacting said cell or cells 

30 with a bioactive lipid phosphate, d) measuring the neurite movement, and e) if needed re- 
peating steps a) to d) for one or more times. The modification of the functional interactor can 
be any of the modifications outlined above with respect to the modification of an interacting 
compound and the modification and selection steps can be repeated one or several times until 



-12- 



a functional interactor has been selected that shows the desired functional interaction, e.g. 
repression or activation of the activity in particular of the enzymatic activity of PRG proteins. 

In a further embodiment of the method of the present invention the interacting compound 
5 identified as outlined above or the functional interactor identified as outlined above, which 
may or may not have gone through additional rounds of modification and selection, is ad- 
mixed with suitable auxiliary substances and/or additives. Such substances comprise pharma- 
cological acceptable substances, which increase the stability, solubility, biocompatibility, or 
biological half-life of the interacting compound or the functional interactor or comprise sub- 
10 stances or materials, which have to be included for certain routs of application like, for exam- 
ple, intravenous solution, sprays, Band- Aids or pills. 

According to a further aspect of the present invention a pharmaceutical composition is pro- 
vided for the treatment of neuronal injuries or diseases comprising a protein of the invention, 
15 a nucleic acid of the invention, a vector of the invention, a cell of the invention, an antibody 
of the invention, a binding component isolated by a method of the invention and/or a func- 
tional interactor isolated by a method of the invention and if needed suitable auxiliary sub- 
stances and/or additives. 

20 Accordingly, a further aspect of the present invention is the use of a pharmaceutical composi- 
tion of the invention for the production of a medicament for the treatment of neuronal dis- 
eases or injuries. Neuronal diseases which can be treated with the pharmaceutical composition 
comprise spinal cord lesion, Alzheimer disease and stroke. Typical neuronal injuries comprise 
traumata of any sort in particular head traumata resulting in the damage of neurons and in 

25 particular the severing of neuronal connections. 

As stated above it has also been found that PRG proteins are differentially expressed in cer- 
tain tissues and that they have been found to be differentially expressed in certain diseases, 
however, differential expression is also associated with certain disease states. Thus, PRG 
30 proteins present attractive targets for diagnosis and treatment of a variety of diseases. There- 
fore, another aspect of the present invention is the use of the proteins or nucleic acids of the 
present invention as diagnostic marker for the diagnosis of a disease or disease state, whereby 
the presence, the absence, or the amount of PRG proteins is evaluated by, for example, im- 
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munological methods, RT-PCR, Northern blot. For the immunological detection and/or quan- 
tification methods the antibodies of the present invention can be used. 

As PRG is differentially regulated in neuronal diseases PRG proteins or nucleic acids are in a 
5 preferred embodiment used as diagnostic marker for the diagnosis of neuronal diseases. 

Furthermore PRG proteins are overexpressed in tumors. PRG-1, for example, is overex- 
pressed in a variety of tumor cells, e.g. astroglioma WHO Grad III-IV, neuroblastoma, kidney 
cell carcinoma, myoblastoma and ovarial cell carcinoma, and in particular it has also been 

10 found to be overexpressed in migrating, i.e. metastasizing cancer cells which have lost the 
anchorage dependence for growth. Therefore, PRG proteins also present attractive targets for 
diagnoses and treatment of tumors, in particular metastasizing tumors. Consequently, a further 
aspect of the present invention is the use of PRG proteins and nucleic acids of the present in- 
vention as a tumor markers and, preferably, as metastatic markers. Because of the above 

15 property PRG proteins are also therapeutic targets for the development of drugs, which 
modulate, preferably inhibit the function of PRG proteins. Such drugs can be identified with 
above-described methods for identifying interacting or functional interacting compounds. 

In a preferred embodiment, PRG proteins of the present invention are used for diagnosis of 
20 cancers selected from the group of cancers consisting of neuroblastoma, astroglioma, ovarial 
cell carcinoma, prostatic cell carcinoma and breast cell carcinoma. 

In addition, PRG proteins, in particular PRG-1, have been found to be overexpressed in dif- 
ferentiating sperm cells and, thus, a further embodiment of the present invention is the use of 

25 the PRG proteins or nucleic acids as diagnostic targets for the diagnosis of infertility, in which 
a lack of PRG proteins would indicate either a low amount of differentiating sperm cells or 
the malfunction of the sperm cells. The amount of expression of PRG proteins or nucleic ac- 
ids can be detected by, for example, in situ immunofluorescence, in situ Northern blots. The 
skilled persons knows a variety of additional methods that are suitable to determine the 

30 amount and distribution of PRG proteins and RNA within a cell ansd/or tissue. 



The following figures and examples merely serve to illustrate the invention and should not be 
construed to restrict the scope of the invention to the particular embodiments of the invention 
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described in the examples. All references cited in the text and the disclosure of the priority 
application EP 02 020 679.3 are hereby incorporated in their entirety by reference. 

Figures 

Fig. 1 Human PRG-1 amino acid sequence (SEQ ID NO. 1). The first 300 amino acids are 
highly conserved among LPP family members. The other 400 amino acids (gray 
boxed sequence) of PRG-1 show no homologies to known sequences. The catalytic 
histidine (His-252) conserved in all members of the LPP-super family is marked 
with an asterisk. 

Fig. 2 Hydrophobicity profile of PRG-1 protein predicted by the Kyte and Doolittle algo- 
rithms. The numbers at the bottom of the profile refer to amino acid residues from 
the amino terminus. The gray shaded area of PRG-1 is predicted as hydrophilic and 
located in the cytosol. 

Fig. 3 Northern blot analysis of PRG-1 mRNA expression 

Fig. 4 Expression pattern of PRG-1 mRNA in the developing and lesioned rat brain de- 
tected by in situ Northern blot. Panel A shows the Toluidene blue staining of a 
brain section on embryonic day 16 (E16). Panel B shows the in situ hybridization 
signal with a probe specific for PRG-1 mRNA of the same section as shown in 
panel A. Panel C shows a Toluidene blue staining of a brain section on embryonic 
day 19 and panel D shows the in situ hybridization with a probe specific for PRG-1 
mRNA of the same section as shown in panel C. Panels E through K show in situ 
Northern blot analysis of brain sections at days 0, 5, 10, 15, 30 after birth, in an 
adult and one day after lesion, respectively. The scale bar in El 9 equals 850 ^im. 
The scale bar in P30 equals 740 \xm and also applies to P0-P15. The scale bar in 
ldal equals 500 fim and also applies to adult. "LV" means lateral ventricle, "LP" 
means lateral posterior thalamic nucleus, "LD" means laterodorsal thalamic nu- 
cleus, "bcp" means basal telencephalic plate, posterior part, "CA1" means cornu 
ammonis, "DG" means dentate gyrus, "RSG" means retrosplenial granular cortex, 
"dal" means days after lesion. 
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Fig. 5 Quantification of the in situ hybridization signals in different regions of the brain 
after lesion, "gel" means granule cell layer, "CA3" means cornu ammonis 3, "CA1" 
means comu ammonis and "dal" means days after lesion. 

5 Fig. 6 Overexpression of a PRG- 1 -eGFP fusion protein in COS-7 cells. Panel A shows the 
fluorescence of the green fluorescent protein, panel B shows the fluorescence of a 
fluorecently labeled anti-PRG-1 peptide antibody and panel C shows the co- 
localization of the fluorescence of the PRG- 1 -eGFP and the anti-PRG-1 antibody. 
In each picture the processes of the COS-7 cells are marked with a white arrow. 
1 0 The scale bar depicts a length of 1 0 um. Panel D shows the result of a Western blot 

using an antiserum raised against a C-terminal peptide of PRG-1 . 

Fig. 7 Immunocytochemical analysis of PRG-1 in the adult rat hippocampus prior and 
after lesion. Pyramidal neurons are labeled in the CA1 and CA3 region. Polymor- 
1 5 phic cells are stained in the hilus. The outermoelcular layer is marked with black er- 

rors. "Gel" means granule cell layer "Oml" means outer molecular layer and "hi" 
means hilus. The scale bar in panel A and B equals 580 um. Panel C shows immu- 
noblots from total protein extracts of adult control and deafferentiated hippocampus 
five days after lesion. 

20 

Fig. 8 Panel A shows a higher magnification of the boxed area of panel B of Fig. 7. Im- 
muno-stained axons are marked with black arrows and the terminal branches with 
white arrows. Panel B shows an electron micrograph of a PRG-1 immunopositive 
axon. The immunopositive axon is delineated by black arrows, while its terminal 
25 branch is delineated by gray arrows, "oml" means outer molecular layer and "ax" 

means axon and "s" means spine. The scale bar in panel A equals 20 um and the 
scale bar in panel B equals 0.4 um. 

Fig. 9 Panel A-C show the result of an RT-PCR analysis of RNA from tissues from El 6 
30 and P0 explants. The amplification was carried out with primers specific for PRG-1 

(panel A), EDG 2, 4 or 7 (panel B) and 0-actin (panel C). Panel D and E depict ex- 
plants from rat entorhinal cortex at embryonic day 16 and panel F and G depict rat 
entorhinal cortex at postnatal day 0 (bottom row of panels). Outgrowing axons are 
marked with white arrows and the panels D and F show neurite retraction in the 
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presence of vehicle (0,9% NaCl) while panels E and G show neurite retraction upon 
the addition of 100 nmol/1 LP A. The scale bar in panel G equals 20 um. 

Fig. 10 Panel A shows the neurite outgrowth length in control cultures (left bar) and LPA 
5 treated cultures, (right bar) in El 6 and postnatal explants (PO) in urn. Panel B 

shows the dose-response of LPA on P0 and El 6 explants. 

Fig. 1 1 Depicts cell rounding and neurite retraction in response to LPA in N1E-1 15 cells. 

The cells depicted in the first three top panels were transfected with a plasmid con- 

10 taining only eGFP and were not treated with LPA but with vehicle only. The cells 

in the second row were transfected with a plasmid containing only eGFP but were 
treated with 10 umol/1 LPA, the cells of the third row were transfected with a plas- 
mid coding for a PRG-l-eGFP fusion protein and were treated with 10 umol/1 LPA 
while the cells depicted in the bottom row of panels were transfected with a mutant 

j 5 prg- 1 -eGFP fusion protein, which carried a His-Ly s exchange in the catalytic his- 

tidine of PRG- 1 (PRG-l His/Lys ) and were also treated with 10 umol/1 LPA. The pan- 
els on the left show transfected cells, panels in the middle show nuclear staining 
(Hoechst staining) and panels on the right show merged images with f-actin stain- 
ing. 

20 

Fig. 12 Panel A shows the quantification of the results shown in Fig. 11 and of a similar 
experiment performed with wild type N1E-1 15 cells (wt). Panel B shows the results 
of RT-PCR using LPA receptor specific primers to assess LPA receptor expression 
in wild type N1E-115 cells (wt) or PRG-1 transfected N1E cells (PRG-1). Panel C 

25 shows the results of RT-PCR using p-actin specific primers to assess p-actin ex- 

pression in wt N1E-1 1 5 cells and in PRG-1 transfected cells. Panel D shows a dose 
response of LPA treatment on neurite length of PRG-1 overexpression cells. Panel 
E shows cell viability of N1E-115 cells 48-72 h after transfection. Panel F shows 
the phosphatase activity of intact cells overexpressing eGFP, PRG-l-eGFP, and 

30 PRG-l His/Lys . 



Fig. 13 Schematic diagram of the proposed axon growth mechanism in a lipid phosphate 
lipid rich environment. "R" means receptor mediating a retraction signal. 
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Fig. 14 Depicts the proposed structure of the human PRG-1 wherein those parts presumed 
to be arranged on the extracellular surface inserted in the plasma membrane and 
protruding into the cytoplasm are shown. 

5 Examples 

1. Isolation of PRG-1 
Animals and surgery 

10 All animals were housed under standard laboratory conditions, and the surgical procedures 
were performed in agreement with the German law (in congruence with 86/609/EEC) for the 
use of laboratory animals. All efforts were made to minimize the number of animals used, and 
all surgical procedures were performed under sufficient anesthesia to minimize animal suf- 
fering. The experimental procedures are described in detail in Brauer et al (2001) FASEB J. 

15 15:2689-2701. 

Substraction cDNA library and differential screening 

The SMART cDNA technology from Clontech was used to generate high yields of full- 
length, double-stranded cDNA from adult, control and lesioned hippocampus rat RNA. To 
20 develop the substraction library, the Clontech PCR-Select cDNA substraction Kit (Heidel- 
berg, Germany) was used. Each clone was dotted in duplicates on Hybond N filters (Amer- 
sham, Germany) and screened with randomly radioactively labeled cDNA (Prime-A-Gene, 
Promega, Germany) from adult non-lesioned and lesioned hippocampus. 

25 2. Analysis of the sequence of PRG-1 

Similar to other members of LPP-family a hydrophobicity analysis of PRG-1 predicts 6 N- 
terminal membrane-spanning regions with a highly conserved phosphatase domain. The 
analysis was done using the DNAsis for Windows Version 2.6; Hitachi Software Engineering 
Co. Hydrophobicity Analysis Submenue using the Kyte & Doolittle algorithm with all set- 

30 tings set to default values. However, unlike the other members of this family the second type 
of the protein consists of a long hydrophilic domain of around 400 amino acids (Fig. 2). Ac- 
cording to the structural models of LPP orientation in the membrane, this C-terminal exten- 
sion is positioned on the cytoplasmatic site and might thus play a role as a regulatory or signal 
transduction domain. Beside the homology of the N-terminal part of the PRG-1 to other 



- 18- 



members of the LPP-family such as LPP-1 and the Drosophila cell migration modulator 
Wunen, GenBank searches revealed only one other related gene (genomic DNA sequence: 
GenBank acc. # NP_01 1255.1 1) for which we cloned the complete cDNA sequence and 
named it PRG-2. This gene shares the same C-terminal extension with partial sequence ho- 
mology. Thus, these genes represent a novel distinct subclass of the LPP-1 family. Amino 
acid residues which have been shown to be essential for ecto-enzyme activity in the LPP-1 
class of proteins are conserved in PRG-1 N-terminal sequences (Fig. 1). Database analysis of 
the C-terminal domains did not detect any significant similarities to any other protein or any 
other matches with known conserved domains (using ProDom and Swiss-Prot databases). A 
GenBank search for orthologous proteins showed that both genes are highly conserved in 
mammals (human/mouse > 93%), and partial EST sequences indicate orthologous proteins in 
Xenopus and Zebrafish, whereas no significant homology for the C-terminal part could be 
found in the Drosophila or other invertebrate genome. 

3 . Nort hern blot analysis of PRG- 1 exp ression 

The RNA isolation was described before 20 ug of total RNA from six adult control and six 1 
dal animals were loaded on a 1% agarose gel containing formaldehyde, transferred to Hy- 
bond™ - N s (Amersham Life Science, UK) and crosslinked by ultraviolet irradiation. As a 
probe for PRG-1, the full length cDNA clone as well as the C-terminal coding region was 
used. As probe for P-actin (as control for mRNA integrity and amounts of mRNA loaded), a 
cDNA fragment amplified by RT-PCR was used. Primer for the amplification of the control 
gene P-actin were: p-actin 5' (5' - CAC CAC AGC TGA GAG GGA AAT CGT GCG TGA - 
3', SEQ ID No. 1 1) spanning bases 2395 - 2424, and p-actin 3'-primer (5' - ATT TGC GGT 
GCA GCA TGG AGO GGC CGG ACT - 3', SEQ ID no. 12) complementary to bases 3095 - 
3124, with an amplificate length of 520 bp for rat p-actin cDNA (GenBank accession no. 
J00691). PCR was performed in 25 ul final volume containing 1 mmol/I dNTPs (Pharmacia 
Biotech, Germany), 2.5 units Taq Polymerase (Perkin Elmer, USA) 2.5 ul 10 x buffer in- 
cluding 2.5 mol/1 MgCl 2 (Perkin Elmer, USA), 10 umol/1 each primer and 1 ul cDNA using a 
Thermo-Cycler PTC-100 (MJ Research, Inc., USA). The cycle program was: 95°C, 2 min; 35 
x [94°C, 30 s; 70°C, 30 s 72°C, 2 min] and 10 min, 72°C. Both probes were labeled with the 
Prime-a-Gene Labelling System (Promega, USA) and [ 32 P] dCTP (DuPont NEN, USA). Hy- 
bridization was performed in 10 ml hybridization solution (250 mmol/1 sodium-phosphate, pH 
7.2, 7o/o SDS, 0.5 mmol/1 EDTA, 1% BSA) at 60°C for 12 h. The membrane was washed in 2 
x SSC at RT, 0.2 x SSC at RT, and 0.2 x SSC at 40°C for 30 min each. Membranes were ex- 
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posed to Kodak X-OMAT AR X-ray films at -80°C for 12 h, using an intensifying screen. 
Northern blot analysis revealed one distinct band which migrated around 5,5 kb. Expression 
of this mRNA was CNS-specific with the exception of a weak expression in testies (see Fig. 
3). Thus, PRG-1 is a novel vertebrate specific protein selectively located in the brain with 
5 putative phosphatase function. 



10 



4. In situ hybridization analysis of PRG-1 expression 

For hybridization, an antisense oligonucleotide (5'- GCA GAG GTC TGA ATT CTA GTG 
TCT ATC GTT ATA GTT CCT TAA CAG TGT GGG -3', SEQ ID No. 13) complementary 
to bases 425 - 475 of a rat EST cDNA clone (GenBank acc. AW 526088.1 was used). The 
oligonucleotide was synthesized by Metabion (Munich, Germany). The specificity was con- 
firmed by a BLAST GenBank search to rule out cross-hybridization to other genes. The pro- 
tocol was used as described by Brauer et al (2002) supra. 

15 The in situ hybridization analysis highlighted the tight regulation of PRG-1 transcription in 
the developing hippocampus. At embryonic day 16 (E16), no PRG-1 transcripts could be de- 
tected in the brain (Fig. 4, panel B). An expression signal first appears at day 19 (El 9) in the 
subventricular zone and specifically in the hippocampal anlage, whereas other cortical regions 
did not show PRG-1 expression (Fig. 4, panel D). From postnatal stages on, PRG-1 mRNA is 

20 present in the hippocampus and in the entorhinal cortex throughout adult stages (see Fig. 4, 
panels E-J). In the dentate gyrus, a region bearing postnatally developing granule cells, weak 
PRG-1 mRNA expression is found in the infrapyramidal blade at P0, whereas the later devel- 
oping suprapyramidal blade first showed expression signals at P5. This expression pattern 
remains unchanged during maturation, however, a reduced expression is apparent in the adult 

25 brain. 



5. PRG-1 mRNA expression after entorhinal cortex lesion 

The treatment of the animals, the surgery the construction of the subtraction cDNA library 
and the screening were carried out as described in example 1. The in situ hybridization was 
30 carried out as described in example 4. PRG-1 is upregulated one day after lesion (dal) and 
peaks at 5 dal in the ipsilateral hippocampus (gel = 37%, hilus 300%, CA1 = 100%, CA3 = 
60%). The contralateral hippocampus (maximum by 1 dal, gel = 16%, hilus = 200%, CA1 = 
59%, CA3 = 46%), as well as the ipsilateral cortex, shows a strong upregulation of PRG-1 
mRNA (maximum by 1 dal 83%) (see Fig. 5). 
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6. Transfection of a PRG-l-eGFP construct 

5 

Antibody generation and immunohistochemistry 

To design a peptide antibody against PRG-1, a sequence in the hydrophilic C-terminal region 
was used. The peptide (NH 2 -CVGVNGDHHVPGNQ-COOH, SEQ ID No. 14), representing 
amino acids 490 - 507 of the PRG-1 rat sequence (GenBank acc. SEQ ID No. 5), was synthe- 

10 sized by BioGenes (Berlin, Germany). The amino-terminal cysteinyl residue, which is not 
part of the PRG-1 sequence, was included for conjugation of the peptide to a carrier protein. 
The peptide was conjugated through the cysteinyl sulfhydryl to maleimide activation (keyhole 
limpet hemocyanin). Rabbits were immunized by BioGenes. The specificity of the peptide 
antibody was further tested on Western blot and on brain sections by blocking via peptide 

1 5 incubation prior to adding the antiserum. The protocol for the immunohistochemistry is de- 
scribed in detail by BrSuer et al (2001) Neuroscience 102:515-526. 

Western Blot analysis 

For Western Blot analysis, rat adult and 5 dal hippocampus extracts were separated on a 12% 
20 SDS/PAGE and electroblotted to nitrocellulose membranes (Millipore, Germany). All incu- 
bation was done overnight at 4°C in PBST. The PRG-1 antiserum was used at a 1:2000 dilu- 
tion. Secondary anti-rabbit antibody coupled with horseradish peroxidase was used at a 
1 :5000 dilution, and visualized by incubation in ECL detection reagents (Amersham Pharma- 
cia, Germany). The protocol for the immunocytochemistry is essentially the same as de- 
25 scribed in detail in Brauer et al. (2001) supra]. 

The immunoblot in Fig. 6D showed a single band obtained by incubation with the anti-PRG-1 
antiserum. The absence of specific signal in the preimmune serum prior to immunization is 
noteworthy. Immunoblots from total protein extracts of adult control and deafferentiated hip- 
30 pocampus shown in Fig. 7C demonstrated an increase five days after lesion (5dal). Data rep- 
resents three separate experiments in each group. Statistical difference is marked with an as- 
terisk (mean ± S.D.), * P < 0.05; Mann- Whitney- tMest. 

Subcellular localization 
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PRG-1 tagged with the eGFP reporter gene was used to identify the subcellular localization. 
Golgi apparatus was visualized with the cell tracker BODYPY ceramide (Molecular Probes, 
Netherlands). The staining protocol was obtained from molecular Probes. 

5 The transfection studies using a PRG-1 construct tagged with a eGFP reporter gene revealed 
that PRG-1 protein was processed in COS-7 cells through the Golgi apparatus (data not 
shown) to its final localization in the plasma membrane of small processes (Fig. 6, panel A). 
To localize the PRG-1 proteins in vivo an antiserum against a peptide from the cytoplasmic C- 
terminus of PRG-1 was raised. This antiserum specifically stained transfected COS-7 cells, 

10 which expressed PRG-1 -eGFP fusion proteins (Fig. 6, panel A-C) and detected a specific 
band in Western blot analysis (Fig. 6, panel B). Both the immunostaining and Western blot 
signal could be blocked by specific peptide incubation prior to the addition of antiserum (data 
not shown). The fluorescence of the labeled entire-PRG-1 peptide antibody and the fluores- 
cence of the eGFP part of the fusion protein colocalized in COS-7 cells and in the processes 

1 5 (see white arrows in panels A-C of Fig. 6). 

7. Expression analysis by RT-PCR 

Tissue from the retraction assays, E16 and P0 explants and N1E-115 cells were used for 
mRNA isolation and as templates for RT-PCR. cDNA from testis was used as a positive con- 

20 trol. The MidiMACS mRNA Isolation Kit (Miltenyi Biotec, Germany) was used to isolate 
mRNA from the explants or cells. Reverse transcription was performed as described by 
Brauer et al. (2000) Hippocampus 10:632-644. PCR was performed with the following prim- 
ers: PRG-l-5'-primer (5'-CTA GGC TTG TAG CTG TGG GGA ATT TC-3\ SEQ ID No. 
15), spanning bases 896 bp -921 bp, and PRG-1 -3'primer (5'-TCA ATC CTT ATA AGC 

25 CCG TGT G-3\ SEQ ID No. 16) complementary to bases 2202 bp - 2225 bp with an ampli- 
fication length of 1329 bp of the PRG-1 cDNA (SEQ ID No. 10). For amplification of the 
EDG receptor cDNA, the primer EDG-2-5' primer (5'-GAA CTT TGC GAG TGA GCT GG- 
3\ SEQ ID No. 17), spanning bases 836 bp -855 bp, and EDG-2-3 'primer (5'TGC GGA 
GAG CTT TAA CCT CC-3\ SEQ ID NO. 18), complementary to bases 1 165 bp - 1 184 bp 

30 with an amplification length of 348 bp of the EDG-2 cDNA (GenBank accession no. 
NM010336), the primer EDG-4-5 'primer (5'-CCT ACC TCT TCC TCATGT TC-3', SEQ ID 
No. 19), spanning bases 344 bp - 363 bp, and EDG-4-3 'primer (5 'TAA AGG GTG GAG 
TCC ATC AG-3', SEQ ID No. 20), complementary to bases 1 199 bp - 1 148 bp of the EDG-4 
cDNA (GenBank accession no. NM020028), the primer EDG-7-5' primer (5'-GGA ATT GCC 



22 



TCT GCA ACA TCT-3', SEQ ID No. 21), spanning bases 673 bp - 693 bp, and EDG-7- 
3'primer (5'-GAG TAG ATG ATG GG TTC A-3\ SEQ ID No. 22), complementary to bases 
1096 bp - 1054 bp of the EDG-7 cDNA (GenBank accession no. NM022983). PCR was per- 
formed using a Thermo-cycler PTC-100 (MJ Research, Inc.) in 25 pi final volume containing 

5 10 umol/1 dNTPs (Pharmacia, Germany), 2.5 units Taq Polymerase (Stratagen, Germany), 2.5 
pi 10 x buffer including 2.5 mol/1 MgCl 2 (Stratagen, Germany), 10 pmol/1 of each primer, and 
2 pi of each cDNA for all molecular analysis. For all EDG receptors, as well as for PRG-1 
amplification, the cycle program was: 2 min at 95°C, 40 x (94<>C, 30 sec; 52»C, 30 sec; and 
72°C, 1 min), and 5 min at 72°C. Amplification of P-actin cDNA was performed as described 

10 by Brauer et al. (2000) Hippocampus 10:632-644. 

g Tmm,,nncvtochemi ™1 fining of rat hippocampus 

The antibody described in experiment 6 was also used for the immunocytochemical staining 
and Western blot analysis. For Western blot analysis, rat adult and 5dal hippocampus extracts 
1 5 were separated on a 12% SDS/PAGE and electroblotted to nitrocellulose membranes (Milh- 
pore Germany). All incubation was done overnight at 4°C in PBST. The PRG-1 antiserum 
was used at a 1:2000 dilution. Secondary anti-rabbit antibody coupled with horseradish per- 
oxidase was used at a 1 :5000 dilution, and visualized by incubation in ECL detection reagents 
(Amersham Pharmacia, Germany). The protocol for the immunocytochemistry is essentially 
20 the same as described in detail in Brauer et al., 2001 (FASEB J, 15, 2689-2701). The immu- 
nocytochemical staining of rat hippocampus revealed that PRG-1 was specifically expressed 
in neurons (see Fig. 7, panel A) and in particular pyramidal neurons are labeled in the CA1 
and CA3 region, polymorphic cells are stained in the hilus and granule cells of the dentate 
gyrus are also immunopositive. However, the outer molecular layer, the termination zone of 
25 afferents from the entorhinal cortex (Skutella T. and Nitsch R. (2001) Trends. Neuroscience 
24 107-163) showed no PRG-1 positive fibers (Fig. 7, panel A). Conversely five days after 
entorhinal cortex lesion, a clear immunoreactive PRG-1 positive band appeared in the dener- 
vated outer molecular layer (Fig. 7, panel B). Western blot analysis revealed a 50% increase 
in PRG-1 expression in the denervated hippocampus (Fig. 7, panel C). PRG-1 immunostain- 
30 ing highlighted single axonal processes in the outer molecular layer which form terminal 
branches (Fig. 8, panel A). The higher magnification of an area from the immuno stained ax- 
ons showed that PRG-1 is indeed localized in the growth cone-like axon structures in the de- 
nervated zones of the hippocampus (see Fig. 8, panel B). 
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Subcellular localization 

5 PRG-1 tagged with the eGFP reporter gene was used to identify the subcellular localization. 
Golgi apparatus was visualized with the cell tracker BODYPY ceramide (Molecular Probes, 
Oregon). The staining protocol was obtained from Molecular Probes. 

9. Effect of PRG-1 expression on the LPA response of neurons 

10 

LPA induced neurite retraction in explants 

Entorhinal explants from El 6 and P0 rat pups were obtained from timed-pregnant Wistar rats 
and were cultivated as described (N. E. Savaskan (2000) Eur. I. Neurosci 12:1024-1032). In 
brief, entorhinal cortex was carefully dissected from the hippocampal anlage and the menin- 

15 ges were removed. Explants were gently transferred with a fire-polished Pasteur pipette into 
12-well plates and cultivated on baked glass cover slides coated with laminin and poly-L- 
lysin (25 ug/ml and 10 ug/ml, respectively) in culture medium containing selenium-defined 
fetal bovine serum [5%] (N. E. Savaskan et al (2000) FASEB J. e-published) (Neurobasal 
medium plus 25 ug/ml Penicillin/Streptomycin; B-27 supplement). After 24 h, culture me- 

20 dium was exchanged and cultivation was further performed in serum-free Neurobasalmedium 
for 20 h. Serum-starved explants were treated with 100 nmol/1 oleoyl-LPA (5 mmol/1 stock 
solution in ultra-filtrated water) for 10 min or with vehicle (0,9% NaCl) and then fixed in 4% 
paraformaldehyde for 20 min. For F-actin staining, fixed tissues were incubated with TRITC- 
phalloidin (0.1 ug/ml, Sigma, Germany) for 40 min, followed by incubation with HOECHST 

25 33258 dye (1:20,000, Sigma, Germany) for 5 min at room temperature. After three washing 
steps in PBS, explants were coverslipped with ImmunoMount (Merck, Germany) prior to 
analysis. Images were taken with a CCD camera on an Olympus BX-50 microscope and 
quantification was performed using the Meta Morph analysis system (Universal Imaging, 
PA). For statistical analysis Statview II was used (Abacus, USA). 

30 

A dose response of LPA neurite length of El 6 and P0 explants was done as described above, 
however, with 0, 0.1, 1.0 and 10 umol/1. 
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Entorhinal explants obtained at day 16 (El 6) do not express PRG-1 while postnatal explants 
(PO) express PRG-1 (see Fig. 9, panel A). In contrast, the LPA-specific receptors EDG-2/4/7 
were equally expressed in both embryonic and postnatal explants (see Fig. 9B). The control 
RT-PCR with p-actin showed the integrity of the RNA from E16 and PO cells. Both embry- 
5 onic and postnatal explants grow equally well under serum-free culture conditions and show 
long extending axons (see Fig. 9, panels D and F). However, their response to LPA differed 
dramatically (see Fig. 9, panels E and G). Whereas application of 100 nmol/1 LPA led to rapid 
neurite retraction in embryonic entorhinal explants (El 6; n = 20, compare panel D to panel 
E), postnatal explants (PO; n = 22, compare panel F to panel G) did not differ significantly 
10 from vehicle treated control cultures. Thus, postnatal entorhinal axons expressing PRG-1 are 
resistant to LPA-induced neurite retraction. The amount of retraction observed in panels D-G 
is quantified in Fig. 10, panel A. The dose response is shown in Fig. 10, panel B. 



10. Differential effect of PRG-1 and PRG-1 mutant on LPA-induced neurite retraction 

15 

Site-directed mutagenesis of PRG-1 HIS/LYS 

The rat PRG-1 full length clone was amplified by Marathon PCR (Clontech, USA) from adult 
rat hippocampus RNA (SEQ ID No. 10). For transfection studies, the full length PRG-1 cod- 
ing sequence was fused to EGFP (pEGFP-Nl vector Clontech, USA). The PRG-1 His/Lys ex- 
20 change mutant at the catalytic histidine (His-252) was introduced in the same protein fusion 
vector by site specific mutagenesis (CAT to AAG). 



LPA induced neurite retraction and protection in N1E-1 15 cells 

N1E-115 mouse neuroblastoma cells (ATCC: CRL-2263) were routinely grown in DMEM 
25 medium supplemented with selenium-defined fetal bovine serum (10%). The cells were 
seeded on baked glass cover slides at a density of 10,000 cells/cm 2 . The next day, cells were 
transfected with the cationic lipids procedure (FuGene6, Roche, Germany) and cultivated for 
24 h. Serum-starvation was performed for 20 h in DMEM medium, followed by treatment 
with 10 |imol/l oleoyl-LPA or vehicle (0.9% sodium chloride) (K. Jalink et al (1993) Cell 
30 Growth Differ. 4:247-255). After 10 min, cells were fixed in 4% paraformaldehyde for 20 
min at room temperature and further analysis processed as described above. Images were 
taken with a CCD camera on an Olympus BX-50 microscope. Quantification was performed 
with the Meta Morph analysis system (University Imaging, PA). For statistical analysis Stat- 
view II was used (Abacus, USA). The effects of transfection on cell viability were analysed 
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by MTT assay and propidium iodide staining (Savaskan N.E. et al. (2002) FASEB J. 17:1 12- 
114). 

N1E-1 15 cells are uniformly sensitive to LPA-induced growth cone collapse (see Fig. 1 1 and 
5 compare first row of panels versus second row of panels). This is also confirmed by phalloi- 
din staining (K. Jalink et al, (1994) J. Cell Biol. 126:801-810) which showed actin polymeri- 
zation upon PRG-1 overexpression, however, led to a resistance of N1E-115 cells to LPA- 
induced growth cone collapse and also prevented LPA-induced actin-polymerization (see Fig. 
1 1 , third row of panels). The mutation of the conserved catalytic histidine (His-252) to a ly- 

10 sine (PRG-1 His/Lys ) a change which has been shown to completely abolish enzymatic function 
of the catalytic center of LPP-1 (N. Zhang, et al. (1997) supra) no longer prevented LPA- 
induced retraction of processes as achieved by the wt-construct This shows that the con- 
served enzymatic domain of the LPP-1 family is necessary for PRG-1 function in attenuating 
LPA-induced neurite retraction. The results of the experiments are quantified in Fig. 12, panel 

15 A. The transfection, however, had no effect on the expression of the LPA-receptors EDG- 
2/4//7 (see Fig. 12, panel B). The control for the integrity of the RNA tested by RT-PCR is 
shown in Fig. 12, panel C The resistance against CPA-induced growth cone collapse 
achieved by PRG-1 overexpression could only be overcome by a 10-fold increase of LP A 
applied to the culture (Fig. 12, panel D). 

20 

11. Ectophosphataseactivitv in PRG-1 transfected N1H-1 15 

The assay procedures used were adapted from those described in Imai et al. (2000) J. Clin. 
Endocrinol. Metab 85:3370-3375 and Hooks S.B. et al. (2001) J. Biol. Chem. 276:4611-4621. 
Briefly, transfected cells were harvested after serum-free cultivation for 20 h by scraping in 

25 lysis buffer (containing 20 mmol/1 HEPES pH 7.4, 1 mmol/1 NaHC0 3 , 500 junol/l DTT, 1 
mmol/1 EGTA) and sonicated with two strokes (Bandolin GM 70, Germany). After centrifu- 
gation at 800 x g, the supernatant was diluted in lysis buffer and centrifiiged at 100,000 x g 
for 1 h. Alternatively, the supernatant was topped on a sucrose step gradient (50% and 5%) 
and centrifuged at 100,000 x g for 1 h (Savaskan, N.E. et al. (2000) Eur. J. Neurosci. 12:1024- 

30 1032. The resulting pellet and interphase, respectively, were re-homogenized and centrifuged 
again at 100,000 x g for 1 h. The crude membrane pellet was re-suspended in reaction buffer 
(50 mmol/1 HEPES pH 7,5; 1 mmol/1 EGTA) and protein concentration was determined 
spectroscopically by Bradford protein assay (Amersham, Germany). Assessment of LPA me- 
tabolism was performed with exogenous 3 H-oleoeyl-LPA (Perkin Elmer, Germany) measur- 
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ing H-oleoeyl-glycerol production. Briefly, 10-25 |ig membrane proteins were pre-warmed to 
37°C in reaction buffer and reactions were started by the addition of 10 nmol/1 3 H-oleoeyl- 
LPA. The reaction was allowed to proceed for 5-30 min at 37°C and stopped with the addition 
of 2.5 vol acidified methanol and 1.5 vol chloroform. After two-phase separation, the chloro- 
5 form phase was dried under N2 and applied to silica gel matrices (Machery Nagel, Germany). 
Plates were developed in chloroform-aceton-acetic acid (90:10:1). All fractions of dried plates 
were scintillation counted and compared with authentic standards (LP A, 1-monoolein, 1,2- 
monoolein, oleic acid). All experiments were performed in triplicate. The ecto-phosphatase 
assay in intact cells was determined as described above. 3 H-LPA was added to the serum-free 
10 medium and the reaction was allowed to proceed for 5-60 min. The reaction was stopped by 
adding 2.5 vol acidified methanol and 1.5 vol chloroform to the supernatant. The radioactivity 
was determined in the lipid fraction as described above. 

The results from three independent sets of experiments are shown in Fig. 12, panel F (one set 
15 with n = 40 for each group in the outgrowth assay). PRG-l-eGFP showed essentially the same 
ecto-phosphatase activity as the wt PRG-1 construct. Statistical differences from controls are 
marked with an asterisk (mean ± S.D.), ***P < 0.001; two-tailed t test with Bonferroni cor- 
rection for multiple comparisons. 

20 The extracellular LPA-degradation achieved by transfected N1E-115 cells revealed a 5-fold 
increase in ecto-phosphatase activity of wildetype PRG-1 transfectants when compared to 
eGFP transfection alone (Fig. 12, panel F). Moreover, point-mutation in the conserved cata- 
lytic domain by His/Lys exchange led to a 95 % decrease of ecto-phosphatase activity (Fig. 
12, panel F). These findings show that PRG-1 has ecto-phosphatase activity which is con- 

25 veyed by a conserved enzymatic domain present in the LPP-1 family, and necessary for at- 
tenuating LPA-induced neurite retraction. In addition transfection of the PRG-1 cnstruct into 
N1E-115 cells did not prevent CPA-induced retraction of processes as achieved by the wt- 
constructs, whereas cell viability was unaffected (see Fig. 12, panel E). 

30 Thus the ecto-phosphatase activity of PRG-1 was shown by two independent experiments: a) 
transfection of a mutant construct into N1E-1 15 cells did not prevent LPA-induced retraction 
of processes as achieved by transfection of wt construct and b) the transfection of the wt con- 
struct led to a 5-fold increase in ecto-phosphatase activity, if compared to cells transfected 
with a control vector. 
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12. Model of axon growth mechanism 
5 Fig. 13 shows a diagram of the proposed axon growth mechanism in a phospholipid rich envi- 
ronment. Axons that are sensitive to repulsive phosphor lipid but do not express PRG-1 are 
unable to cross a phosphor lipid-rich barrier. In contrasts PRG-1 expressing neurons can grow 
through a phosphor lipid rich zone by local depleting the extracellular pool of repulsive phos- 
phor lipids acting as ligands on EDG-receptors. This way, PRG-1 may regulate the activation 
10 of EDG-receptors and thereby modulate axonal outgrowth. 
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20 25 30 
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Tyr 
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Cys Cys 



Leu 



Ser Lys Arg Arg Asn Gly Val Gly Leu Glu Pro Asn 
150 155 160 



lie Asn Ala Gly Gly Cys Asn Phe Asn Ser Phe Leu Arg Arg Ala 
165 *• ~* ~ 



170 



Val 



175 



Arg Phe Val Gly Val His Val Phe Gly Leu Cys Ser Thr Ala Leu He 
180 185 190 

Thr Asp lie He Gin Leu Ser Thr Gly Tyr Gin Ala Pro Tyr Phe Leu 
195 200 205 



Thr Val Cys Lys Pro Asn Tyr Thr Ser Leu Asn Val Ser Cys Lys Glu 



215 



220 



Asn Ser Tyr He Val Glu Asp He Cys Ser Gly Ser Asp Leu Thr Val 



230 



235 



240 



He Asn Ser Gly Arg Lys Ser Phe Pro Ser Gin His Ala 
245 250 



Thr Leu Ala 
255 



Ala Phe Ala Ala Val Tyr Val Ser Met Tyr Phe Asn Ser Thr Leu Thr 
260 265 270 

Asp Ser Ser Lys Leu Leu Lys Pro Leu Leu Val Phe Thr Phe He He 
275 280 285 



Cys Gly He He Cys Gly Leu Thr Arg He Thr Gin Tyr Lys Asn Hi, 



295 



300 



Pro Val Asp Val Tyr Cys Gly Phe Leu He Gly Gly Gly He Ala Leu 



310 



315 



320 



Tyr Leu Gly Leu Tyr Ala Val Gly Asn Phe Leu Pro Ser Asp Glu 



330 



Ser 



335 



Met Phe Gin His Arg Asp Ala Leu Arg Ser Leu Thr Asp Leu Asn Gin 
340 345 350 



Asp Pro Asn Arg Leu Leu Ser Ala Lys Asn Gly Ser Ser Ser Asp Gly 
J55 360 1 



365 



He Ala His Thr Glu Gly lie Leu Asn Arg Asn His Arg Asp Ala Ser 
" J 375 380 



Ser Leu Thr Asn Leu Lys Arg Ala Asn Ala Asp Val Glu He He Thr 

395 400 



Pro Arg Ser Pro Met Gly Lys Glu Asn Met Val Thr Phe Ser Asn 



405 



410 



Thr 



415 



3 



Leu Pro Arg Ala Asn Thr Pro Ser Val Glu Asp Pro Val Arg Arg Asn 
420 425 430 



Ala Ser lie His Ala Ser Met Asp Ser Ala Arg Ser Lys Gin Leu Leu 
435 440 445 



Thr Gin Trp Lys Asn Lys Asn Glu Ser Arg Lys Leu Ser Leu Gin Val 
450 455 460 



lie Glu Pro Glu Pro Gly Gin Ser Pro Pro Arg Ser lie Glu Met Arg 
465 470 475 480 



Ser Ser Ser Glu Pro Ser Arg Val Gly Val Asn Gly Asp His His Gly 
4 85 4 90 ~ 4 95 



Pro Gly Asn Gin Tyr Leu Lys lie Gin Pro Gly Ala Val Pro Gly Cys 
500 505 510 



Asn Asn Ser Met Pro Gly Gly Pro Arg Val Ser lie Gin Ser Arg Pro 
515 520 525 



Gly Ser Ser Gin Leu Val His lie Pro Glu Glu Thr Gin Glu Asn lie 
530 535 540 



Ser Thr Ser Pro Lys Ser Ser Ser Ala Arg Ala Lys Trp Leu Lys Ala 
545 550 555 ^ 560 

Ala Glu Lys Thr Val Ala Cys Asn Arg Ser Asn Ser Gin Pro Arg lie 
565 570 575 



Met Gin Val lie Ala Met Ser Lys Gin Gin Gly Val Leu Gin Ser Ser 
580 585 590 



Pro Lys Asn Thr Glu Gly Ser Thr Val Ser Cys Thr Gly Ser lie Arg 
595 600 605 



Tyr Lys Thr Leu Thr Asp His Glu Pro Ser Gly lie Val Arg Val Glu 
610 615 * 620 



Ala His Pro Glu Asn Asn Arg Pro lie lie Gin lie Pro Ser Thr Glu 
625 630 635 640 

Gly Glu Gly Ser Gly Ser Trp Lys Trp Lys Ala Pro Glu Lys Gly Ser 
645 650 655 

Leu Arg Gin Thr Tyr Glu Leu Asn Asp Leu Asn Arg Asp Ser Glu Ser 
660 665 670 



4 



Cys Glu Ser Leu Lys Asp Ser Phe Gly Ser Gly Asp Arg Lys Arg Ser 
67 5 680 685 



Asn He Asp Ser Asn Glu His His His His Gly He Thr Thr He Arg 
690 695 700 



Val Thr Pro Val Glu Gly Ser Glu He Gly Ser Glu Thr Leu Ser He 
705 710 715 720 

Ser Ser Ser Arg Asp Ser Thr Leu Arg Arg Lys Gly Asn He He Leu 
725 730 735 

He Pro Glu Arg Ser Asn Ser Pro Glu Asn Thr Arg Asn He Phe Tyr 
740 745 750 

Lys Gly Thr Ser Pro Thr Arg Ala Tyr Lys Asp 
755 760 



<210> 2 

<211> 746 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met He Ser Thr Lys Glu Lys Asn Lys He Pro Lys Asp Ser Met Thr 
1 5 10 15 

Leu Leu Pro Cys Phe Tyr Phe Val Glu Leu Pro He Val Ala Ser Ser 
20 25 30 

He Val Ser Leu Tyr Phe Leu Glu Leu Thr Asp Leu Phe Lys Pro Ala 
35 40 45 

Lys Val Gly Phe Gin Cys Tyr Asp Arg Thr Leu Ser Met Pro Tyr Val 
50 55 60 

Glu Thr Asn Glu Glu Leu He Pro Leu Leu Met Leu Leu Ser Leu Ala 
65 70 75 ^ 80 

Phe Ala Ala Pro Ala Ala Ser He Met Val Ala Glu Gly Met Leu Tyr 
85 90 95 

Cys Leu Gin Ser Arg Leu Trp Gly Arg Ala Gly Gly Pro Ala Gly Ala 
100 105 HO 

Glu Gly Ser He Asn Ala Gly Gly Cys Asn Phe Asn Ser Phe Leu Arg 
H5 120 125 



5 



Arg Thr Val Arg Phe Val Gly Val His Val Phe Gly Leu Cys Ala Thr 
130 135 140 



Ala Leu Val Thr Asp Val lie Gin Leu Ala Thr Gly Tyr His Thr Pro 
145 150 155 160 



Phe Phe Leu Thr Val Cys Lys Pro Asn Tyr Thr Leu Leu Gly Thr Ser 
165 170 175 



Cys Glu Val Asn Pro Tyr lie Thr Gin Asp lie Cys Ser Gly His Asp 
180 185 " 190 



lie His Ala lie Leu Ser Ala Arg Lys Thr Phe Pro Ser Gin His Ala 
195 200 205 



Thr Leu Ser Ala Phe Ala Ala Val Tyr Val Ser Val Ser Pro Ala Pro 
210 215 220 



His Cys Pro Ser Gin Ala Leu Leu Leu Thr Arg Gly Glu Pro Ser Leu 

225 230 235 240 

Thr Pro Thr Pro Met Pro Gin Met Tyr Phe Asn Ser Val lie Ser Asp 

245 250 255 



Thr Thr Lys Leu Leu Lys Pro lie Leu Val Phe Ala Phe Ala lie Ala 
260 265 270 



Ala Gly Val Cys Gly Leu Thr Gin lie Thr Gin Tyr Arg Ser His Pro 
275 280 285 



Val Asp Val Tyr Ala Gly Phe Leu lie Gly Ala Gly He Ala Ala Tyr 
290 295 300 



Leu Ala Cys His Ala Val Gly Asn Phe Gin Ala Pro Pro Ala Glu Lys 
305 310 315 320 

Pro Ala Ala Pro Ala Pro Ala Lys Asp Ala Leu Arg Ala Leu Thr Gin 
325 330 ~ 335 

Arg Gly His Asp Ser Val Tyr Gin Gin Asn Lys Ser Val Ser Thr Asp 
340 345 350 



Glu Leu Gly Pro Pro Gly Arg Leu Glu Gly Ala Pro Arg Pro Val Ala 
355 360 365 



Arg Glu Lys Thr Ser Leu Gly Ser Leu Lys Arg Ala Ser Val Asp Val 



370 



375 



380 



Asp Leu Leu Ala Pro Arg Ser Pro Met Ala Lys Glu Asn Met Val Thr 
385 390 395 400 

Phe Ser His Thr Leu Pro Arg Ala Ser Ala Pro Ser Leu Asp Asp Pro 
405 410 415 

Ala Arg Arg His Met Thr lie His Val Pro Leu Asp Ala Ser Arq Ser 
420 425 430 

Lys Gin Leu He Ser Glu Trp Lys Gin Lys Ser Leu Glu Gly Arq Glv 
435 440 445 

Leu Gly Leu Pro Asp Asp Ala Ser Pro Gly His Leu Arg Ala Pro Ala 
4 50 4 55 4 60 " ~ 

Glu Pro Met Ala Glu Glu Glu Glu Glu Glu Glu Asp Glu Glu Glu Glu 
465 470 475 480 

Glu Glu Glu Glu Glu Glu Glu Asp Glu Gly Pro Ala Pro Pro Ser Leu 
485 490 495 

Tyr Pro Thr Val Gin Ala Arg Pro Gly Leu Gly Pro Arg Val He Leu 
500 505 51.0 



Pro Pro Arg Ala Gly Pro Pro Pro Leu Val His He Pro Glu Glu 



515 



520 



525 



Gly 



Ala Gin Thr Gly Ala Gly Leu Ser Pro Lys Ser Gly Ala Gly Val Arg 
530 535 540 

Ala Lys Trp Leu Met Met Ala Glu Lys Ser Gly Ala Ala Val Ala Asn 

545 550 555 560 



Pro Pro Arg Leu Leu Gin Val He Ala Met Ser 



565 



570 



Lys Ala Pro Gly Ala 
575 



Pro Gly Pro Lys Ala Ala Glu Thr Ala Ser Ser Ser Ser Ala Ser Ser 
580 585 590 

Asp Ser Ser Gin Tyr Arg Ser Pro Ser Asp Arg Asp Ser Ala Ser He 
595 600 605 



Val Thr He Asp Ala His Ala Pro His His Pro Val Val His Leu Ser 
610 615 620 



7 



Ala Gly Gly Ala Pro Trp Glu Trp Lys Ala Ala Gly Gly Gly Ala Lys 
625 630 635 " 640 



Ala Glu Ala Asp Gly Gly Tyr Glu Leu Gly Asp Leu Ala Arg Gly Phe 
645 650 655 



Arg Gly Gly Ala Lys Pro Pro Gly Val Ser Pro Gly Ser Ser Val Ser 
660 665 670 



Asp Val Asp Gin Glu Glu Pro Arg Phe Gly Ala Val Ala Thr Val Asn 
675 680 685 



Leu Ala Thr Gly Glu Gly Leu Pro Pro Leu Gly Ala Ala Asp Gly Ala 
690 695 ~ 700 



Leu Gly Pro Gly Ser Arg Glu Ser Thr Leu Arg Arg His Ala Gly Gly 
705 710 715 720 



Leu Gly Leu Ala Glu Arg Glu Ala Glu Ala Glu Ala Glu Gly Tyr Phe 
725 730 735 



Arg Lys Met Gin Ala Arg Arg Phe Pro Asp 
740 745 



<210> 3 

<211> 766 

<212> PRT 

<213> Mus musculus 

<400> 3 

Met Gin Arg Ala Gly Ser Ser Gly Ala Arg Gly Glu Cys Asp lie Ser 
1 5 10 15 



Gly Ala Gly Arg Leu Arg Leu Glu Gin Ala Ala Arg Leu Gly Gly Arg 
20 25 30 



Thr Val His Thr Ser Pro Gly Gly Gly Leu Gly Ala Arg Gin Ala Ala 
35 40 45 



Gly Met Ser Ala Lys Glu Arg Pro Lys Gly Lys Val lie Lys Asp Ser 
50 55 60 



Val Thr Leu Leu Pro Cys Phe Tyr Phe Val Glu Leu Pro lie Leu Ala 
65 70 75 80 



Ser Ser Val Val Ser Leu Tyr Phe Leu Glu Leu Thr Asp Val Phe Lys 
85 90 95 



8 



Pro Val His Ser Gly Phe Ser Cys Tyr Asp Arg Ser Leu Ser Met Pro 
100 105 110 



Tyr lie Glu Pro Thr Gin Glu Ala lie Pro Phe Leu Met Leu Leu Ser 
115 120 125 



Leu Ala Phe Ala Gly Pro Ala lie Thr lie Met Val Gly Glu Gly He 
130 135 140 



Leu Tyr Cys Cys Leu Ser Lys Arg Arg Asn Gly Ala Gly Leu Glu Pro 
145 150 155 160 



Asn He Asn Ala Gly Gly Cys Asn Phe Asn Ser Phe Leu Arg Arg Ala 
165 170 175 



Val Arg Phe Val Gly Val His Val Phe Gly Leu Cys Ser Thr Ala Leu 
180 185 190 



He Thr Asp He He Gin Leu Ser Thr Gly Tyr Gin Ala Pro Tyr Phe 
195 200 205 



Leu Thr Val Cys Lys Pro Asn Tyr Thr Ser Leu Asn Val Ser Cys Lys 
210 215 220 



Glu Asn Ser Tyr lie Val Glu Asp lie Cys Ser Gly Ser Asp Leu Thr 
225 230 235 240 



Val lie Asn Ser Gly Arg Lys Ser Phe Pro Ser Gin His Ala Thr Leu 
245 250 255 



Ala Ala Phe Ala Ala Val Tyr Val Ser Met Tyr Phe Asn Ser Thr Leu 
260 265 270 



Thr Asp Ser Ser Lys Leu Leu Lys Pro Leu Leu Val Phe Thr Phe lie 
275 280 285 



lie Cys Gly lie lie Cys Gly Leu Thr Arg lie Thr Gin Tyr Lys Asn 
290 295 300 



His Pro Val Asp Val Tyr Cys Gly Phe Leu He Gly Gly Gly lie Ala 
305 310 315 320 



Leu Tyr Leu Gly Leu Tyr Ala Val Gly Asn Phe Leu Pro Ser Glu Asp 
325 330 335 



Ser Met Leu Gin His Arg Asp Ala Leu Arg Ser Leu Thr Asp Leu Asn 
340 345 350 



9 



Gin Asp Pro Ser Arg Val Leu Ser Ala Lys Asn Gly Ser Ser Gly Asp 
355 360 365 



Gly He Ala His Thr Glu Gly He Leu Asn Arg Asn His Arg Asp Ala 
370 375 380 



Ser Ser Leu Thr Asn Leu Lys Arg Ala Asn Ala Asp Val Glu He He 
385 390 395 ~ 400 

Thr Pro Arg Ser Pro Met Gly Lys Glu Ser Met Val Thr Phe Ser Asn 
405 410 415 

Thr Leu Pro Arg Ala Asn Thr Pro Ser Val Glu Asp Pro Val Arg Arg 

420 425 430 



Asn Ala Ser He His Ala Ser Met Asp Ser Ala Arg Ser Lys Gin Leu 
435 440 445 



Leu Thr Gin Trp Lys Ser Lys Asn Glu Ser Arg Lys Met Ser Leu Gin 
450 455 460 



Val Met Asp Thr Glu Pro Glu Gly Gin Ser Pro Pro Arg Ser He Glu 

465 470 475 480 

Met Arg Ser Ser Ser Glu Pro Ser Arg Val Gly Val Asn Gly Asp His 

485 490 495 



His Val Pro Gly Asn Gin Tyr Leu Lys He Gin Pro Gly Thr Val Pro 
500 505 510 



Gly Cys Asn Asn Ser Met Pro Gly Gly Pro Arg Val Ser He Gin Ser 
515 520 525 



Arg Pro Gly Ser Ser Gin Leu Val His He Pro Glu Glu Thr Gin Glu 
530 535 540 



Asn lie Ser Thr Ser Pro Lys Ser Ser Ser Ala Arg Ala Lys Trp Leu 

545 550 555 " 560 

Lys Ala Ala Glu Lys Thr Val Asp Cys Asn Arg Ser Asn Asn Gin Pro 

565 570 575 

Arg He Met Gin Val He Ala Met Ser Lys Gin Gin Gly Val Leu Gin 

580 585 590 

Ser Ser Pro Lys Asn Ala Glu Gly Ser Thr Val Thr Cys Thr Gly Ser 

595 600 605 
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He Arg Tyr Lys Thr Leu Thr Asp His Glu Pro Ser Gly He Val Arq 
610 615 620 

Val Glu Ala His Pro Glu Asn Asn Arg Pro He He Gin He Pro Ser 
625 630 635 640 

Ser Thr Glu Gly Glu Gly Ser Gly Ser Trp Lys Trp Lys Val Pro Glu 
645 650 655 

Lys Ser Ser Leu Arg Gin Thr Tyr Glu Leu Asn Asp Leu Asn Arg Asp 
660 665 670 

Ser Glu Ser Cys Glu Ser Leu Lys Asp Ser Phe Gly Ser Gly Asp Arg 
675 680 685 

Lys Arg Ser Asn He Asp Ser Asn Glu His His His His Gly He Thr 
690 695 700 

Thr He Arg Val Thr Pro Val Glu Gly Ser Glu lie Gly Ser Glu Thr 
705 710 715 720 

Leu Ser Val Ser Ser Ser Arg Asp Ser Thr Leu Arg Arg Lys Glv Asn 
725 730 735 

He He Leu He Pro Glu Arg Ser Asn Ser Pro Glu Asn Thr Arg Asn 
740 745 750 

He Phe Tyr Lys Gly Thr Ser Pro Thr Arg Ala Tyr Lys Asp 
755 760 765 

<210> 4 

<211> 716 

<212> PRT 

<213> Mus musculus 

<400> 4 

Met Leu Ala Met Lys Glu Lys Asn Lys Thr Pro Lys Asp Ser Met Thr 
1 5 10 15 

Leu Leu Pro Cys Phe Tyr Phe Val Glu Leu Pro He Val Ala Ser Ser 
20 25 30 

He Val Ser Leu Tyr Phe Leu Glu Leu Thr Asp Leu Phe Lys Pro Ala 
35 40 45 



Lys Val Gly Phe Gin Cys Tyr Asp Arg Ala Leu Ser Met Pro Tyr Val 
50 55 60 



11 



Glu Thr Asn Glu Glu Leu He Pro Leu Leu Met Leu Leu Ser Leu Ala 

65 70 75 80 

Phe Ala Ala Pro Ala Ala Ser He Met Val Gly Glu Gly Met Val Tyr 
85 90 ' 95 

Cys Leu Gin Ser Arg Leu Trp Gly Arg Gly Pro Gly Gly Val Glu Gly 
100 105 J no 

Ser He Asn Ala Gly Gly Cys Asn Phe Asn Ser Phe Leu Arg Arg Thr 
115 120 125 



Val Arg Phe Val Gly Val His Val Phe Gly Leu Cys Ala Thr Ala Leu 
130 135 140 



Val Thr Asp Val He Gin Leu Ala Thr Gly Tyr His Thr Pro Phe Phe 
145 150 155 160 

Leu Thr Val Cys Lys Pro Asn Tyr Thr Leu Leu Gly Thr Ser Cys Glu 
165 170 175 

Ser Asn Pro Tyr He Thr Gin Asp He Cys Ser Gly His Asp Thr His 
180 185 190 

Ala He Leu Ser Ala Arg Lys Thr Phe Pro Ser Gin His Ala Thr Leu 
195 200 205 

Ser Ala Phe Ala Ala Val Tyr Val Ser Met Tyr Phe Asn Ala Val He 
210 215 220 



Ser Asp Thr Thr Lys Leu Leu Lys Pro He Leu Val Phe Ala Phe Ala 
225 230 235 240 

He Ala Ala Gly Val Cys Gly Leu Thr Gin He Thr Gin Tyr Arg Ser 
245 250 255 

His Pro Val Asp Val Tyr Ala Gly Phe Leu He Gly Ala Gly He Ala 
260 265 270 

Ala Tyr Leu Ala Cys His Ala Val Gly Asn Phe Gin Ala Pro Pro Ala 
275 280 285 



Glu Lys Val Pro Thr Pro Ala Pro Ala Lys Asp Ala Leu Arg Ala Leu 
290 295 300 



Thr Gin Arg Gly His Glu Ser Met Tyr Gin Gin Asn Lys Ser Val Ser 
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305 310 315 320 



Thr Asp Glu Leu Gly Pro Pro Gly Arg Leu Glu Gly Val Pro Arg Pro 
325 330 335 



Val Ala Arg Glu Lys Thr Ser Leu Gly Ser Leu Lys Arg Ala Ser Val 
340 ~ 345 350 



Asp Val Asp Leu Leu Ala Pro Arg Ser Pro Met Gly Lys Glu Gly Met 
355 360 365 



Val Thr Phe Ser Asn Thr Leu Pro Arg Val Ser Thr Pro Ser Leu Asp 
370 375 380 



Asp Pro Ala Arg Arg His Met Thr lie His Val Pro Leu Asp Ala Ser 
385 390 395 400 



Arg Ser Arg Gin Leu lie Gly Glu Trp Lys Gin Lys Ser Leu Glu Gly 
405 410 415 



Arg Gly Leu Gly Leu Pro Asp Glu Ala Ser Pro Val His Leu Arg Ala 
420 425 430 



Pro Ala Glu Gin Val Ala Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu 
435 440 445 



Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Gly Pro Val Pro Pro Ser 
450 455 460 



Leu Tyr Pro Thr Val Gin Ala Arg Pro Gly Leu Gly Pro Arg Val lie 
465 470 475 480 



Leu Pro Pro Arg Pro Gly Pro Gin Pro Leu Val His lie Pro Glu Glu 
485 490 495 



Gly Val Gin Ala Gly Ala Gly Leu Ser Pro Lys Ser Ser Ser Ser Ser 
500 505 510 



Val Arg Ala Lys Trp Leu Ser Val Ala Glu Lys Gly Gly Gly Pro Val 
515 520 525 



Ala Val Ala Pro Ser Gin Pro Arg Val Ala Asn Pro Pro Arg Leu Leu 
530 535 540 



Gin Val lie Ala Met Ser Lys Ala Ala Gly Gly Pro Lys Ala Glu Thr 
545 550 555 560 
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Ala Ser Ser Ser Ser Ala Ser Ser Asp Ser Ser Gin Tyr Arg Ser Pro 
565 570 575 



Ser Asp Arg Asp Ser Ala Ser lie Val Thr lie Asp Ala His Ala Pro 
580 585 590 



His His Pro Val Val His Leu Ser Ala Gly Ser Thr Pro Trp Glu Trp 
595 600 605 



Lys Ala Lys Val Val Glu Gly Glu Gly Ser Tyr Glu Leu Gly Asp Leu 
610 615 620 



Ala Arg Gly Phe Arg Ser Ser Cys Lys Gin Pro Gly Met Gly Pro Gly 
625 630 635 ~ 640 



Ser Pro Val Ser Asp Val Asp Gin Glu Glu Pro Arg Phe Gly Ala Val 
645 650 655 



Ala Thr Val Asn Leu Ala Thr Gly Glu Gly Leu Pro Pro Pro Gly Ala 
660 665 670 



Ser Glu Gly Ala Leu Gly Ala Gly Ser Arg Glu Ser Thr Leu Arg Arg 
675 680 685 



Gin Val Gly Gly Leu Ala Glu Arg Glu Val Glu Ala Glu Ala Glu Ser 
690 695 700 



Tyr Tyr Arg Arg Met Gin Ala Arg Arg Tyr Gin Asp 
705 710 " 715 



<210> 5 

<211> 766 

<212> PRT 

<213> Rattus norvegicus 

<400> 5 

Met Gin Arg Ala Gly Ser Ser Gly Ala Arg Gly Glu Cys Asp lie Ser 
1 5 10 ^ 15 

Gly Thr Gly Arg Leu Arg Leu Glu Gin Ala Ala Arg Leu Gly Gly Arg 
20 25 " 30 

Ala Val His Thr Ser Pro Thr Gly Gly Leu Gly Ala Arg Gin Val Ala 
35 40 45 



Gly Met Ser Ala Lys Glu Arg Pro Lys Gly Lys Val lie Lys Asp Ser 
50 55 60 
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Val Thr Leu Leu Pro Cys Phe Tyr Phe Val Glu Leu Pro lie Leu Ala 
65 70 75 80 



Ser Ser Val Val Ser Leu Tyr Phe Leu Glu Leu Thr Asp Val Phe Lys 
85 90 95 



Pro Val His Ser Gly Phe Ser Cys Tyr Asp Arg Ser Leu Ser Met Pro 
100 105 110 



Tyr lie Glu Pro Thr Gin Glu Ala lie Pro Phe Leu Met Leu Leu Ser 
115 120 125 

Leu Ala Phe Ala Gly Pro Ala He Thr He Met Val Gly Glu Gly He 
130 135 140 

Leu Tyr Cys Cys Leu Ser Lys Arg Arg Asn Gly Ala Gly Leu Glu Pro 
145 150 155 160 

Asn lie Asn Ala Gly Gly Cys Asn Phe Asn Ser Phe Leu Arg Arg Ala 
165 170 175 



Val Arg Phe Val Gly Val His Val Val Gly Leu Cys Ser Thr Ala Leu 
180 " 185 190 



He Thr Asp He He Gin Leu Ala Thr Gly Tyr Gin Ala Pro Tyr Phe 
195 200 205 

Leu Thr Val Cys Lys Pro Met Tyr Thr Ser Leu Glu Gly Ser Cys Lys 
210 " 215 220 

Glu Asn Ser Tyr He Val Glu Glu He Cys Ser Gly Ser Asp Leu Thr 
225 230 235 240 



Val He Asn Asn Gly Lys Lys Ser Phe Pro Ser Gin His Ala Thr Leu 
245 250 255 



Ala Ala Phe Ala Ala Val Tyr Val Ser Met Tyr Phe Asn Ser Thr Leu 
260 265 270 



Thr Asp Ser Ser Lys Leu Leu Lys Pro Leu Leu Val Phe Thr Phe He 
275 " 280 285 



He Cys Gly He He Cys Gly Leu Thr Arg He Thr Gin Tyr Lys Asn 
290 295 300 



His Pro Val Asp Val Tyr Cys Gly Phe Leu He Gly Gly Gly He Ala 
305 310 315 320 
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Leu Tyr Leu Gly Leu Tyr Ala Val Gly Asn Phe Leu Pro Ser Glu Asp 
325 330 335 



Ser Met Leu Gin His Arg Asp Ala Leu Arg Ser Leu Thr Asp Leu Asn 
340 345 " 350 



Gin Asp Pro Ser Arg Val Leu Ser Ala Lys Asn Gly Ser Ser Gly Asp 
355 360 365 



Gly lie Ala His Thr Glu Gly lie Leu Asn Arg Asn His Arg Asp Ala 
370 375 380 



Ser Ser Leu Thr Asn Leu Lys Arg Ala Asn Ala Asp Val Glu lie lie 
385 390 395 400 



Thr Pro Arg Ser Pro Met Gly Lys Glu Ser Met Val Thr Phe Ser Asn 
405 410 415 



Thr Leu Pro Arg Ala Asn Thr Pro Ser Val Glu Asp Pro Val Arg Arg 
4 20 425 4 30 



Asn Ala Ser lie His Ala Ser Met Asp Ser Ala Arg Ser Lys Gin Leu 
4 35 440 445 



Leu Thr Gin Trp Lys Ser Lys Asn Glu Ser Arg tys Met Ser Leu Gin 
450 455 460 



Val Met Asp Ser Glu Pro Glu Gly Gin Ser Pro Pro Arg Ser lie Glu 
465 470 475 " 480 



Met Arg Ser Ser Ser Glu Pro Ser Arg Val Gly Val Asn Gly Asp His 
485 490 495 



His Val Pro Gly Asn Gin Tyr Leu Lys lie Gin Pro Gly Thr Val Pro 
500 505 510 



Gly Cys Asn Asn Ser Met Pro Ala Gly Pro Arg Val Ser lie Gin Ser 
515 520 525 



Arg Pro Gly Ser Ser Gin Leu Val His lie Pro Glu Glu Thr Gin Glu 
530 535 540 



Asn lie Ser Thr Ser Pro Lys Ser Ser Ser Ala Arg Ala Lys Trp Leu 

545 550 555 560 

Lys Ala Ala Glu Lys Thr Val Ala Cys Asn Arg Gly Asn Asn Gin Pro 

565 570 575 
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Arg lie Met Gin Val lie Ala Met Ser Lys Gin Gin Gly Val Leu Gin 
580 585 590 

Ser Ser Pro Lys Asn Ala Glu Gly Ser Thr Val Thr Cys Thr Gly Ser 
595 600 605 

lie Arg Tyr Lys Thr Leu Thr Asp His Glu Pro Ser Gly lie Val Arg 
610 615 620 

Val Glu Ala His Pro Glu Asn Asn Arg Pro lie lie Gin lie Pro Ser 
625 630 635 640 

Ser Thr Glu Gly Glu Gly Ser Gly Ser Trp Lys Trp Lys Ala Pro Glu 
645 650 655 

Lys Ser Ser Leu Arg Gin Thr Tyr Glu Leu Asn Asp Leu Asn Arg Asp 
660 665 670 

Ser Glu Ser Cys Glu Ser Leu Lys Asp Ser Phe Gly Ser Gly Asp Arg 
675 680 685 

Lys Arg Lys His lie Asp Ser Asn Glu His His His His Gly lie Thr 
690 695 700 

Thr lie Arg Val Thr Pro Val Glu Gly Ser Glu He Gly Ser Glu Thr 
705 710 715 720 

Leu Ser Val Ser Ser Ser Arg Asp Ser Thr Leu Arg Arg Lys Gly Asn 
725 730 735 

He He Leu He Pro Glu Arg Ser Asn Ser Pro Glu Asn Thr Arg Asn 
740 745 750 

He Phe Tyr Lys Gly Thr Ser Pro Thr Arg Pro Tyr Lys Asp 
755 760 765 

<210> 6 

<211> 5083 

<212> DNA 

<213> Homo sapiens 

<400> 6 

ggatccacta gtaacggccg ccagtgtgct ggaattcgcc cttgaagcca ttgcagcaac 60 

agcttggagg agggagctgg acgtcgtctc tcgccagaaa aacggggagc aggagccaga 120 

ctaaaggagg aagaggactg gcccgctcag ggaatagctg ggttgctgca aaaaggggcg 180 

gggagaaggc gggggcgctg catgcagcgc gctggctcca gcggtggccg cggggaatgt 240 
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gacatcagcg 


gcgccgggcg 


cttggggctg 


gaggaggcgg 


ctcgcctcag 


ctgcgctgtg 


300 


cacacctcgc 


ccgggggagg 


acgcagaccc 


gggcaggcgg 


cagggatgtc 


ggcgaaggag 


360 


aggccaaagg 


gcaaagtgat 


caaggacagc 


gtcaccctcc 


tgccctgctt 


ttatttcgtc 


420 


gagttgccta 


tattggcatc 


atcggtggtt 


agcctctatt 


tcctcgaact 


cacagatgtc 


480 


ttcaaacctg 


tgcactctgg 


atttagctgc 


tatgaccgga 


gtcttagcat 


gccgtacatt 


540 


gaaccaaccc 


aggaggcaat 


tccattcctc 


atgttgctta 


gcttggcttt 


tgctggacct 


600 


gcaattacga 


ttatggtagg 


agaaggaatt 


ctctactgtt 


gcctctccaa 


aagaagaaat 


660 


ggggtcggac 


tagagcccaa 


cattaatgct 


ggaggctgca 


acttcaattc 


cttcctcaga 


720 


cgagctgtca 


gattcgttgg 


tgttcatgta 


tttggattat 


gctctacagc 


tctcattaca 


780 


gatatcatac 


agctgtccac 


aggatatcaa 


gcaccttact 


ttctgactgt 


gtgcaaacca 


840 


aactatacct 


ctctgaatgt 


atcttgcaaa 


gaaaattcct 


acattgtgga 


agatatttgc 


900 


tcaggatctg 


acctcacagt 


tatcaacagt 


ggcagaaagt 


ccttcccttc 


tcaacatgca 


960 


acccttgctg 


cctttgcagc 


tgtgtatgtt 


tcgatgtact 


tcaattccac 


attaacggat 


1020 


tcctctaagc 


ttctgaaacc 


tctcttggtc 


ttcacattta 


tcatctgtgg 


aataatctgc 


1080 


gggctaacac 


ggataactca 


gtataagaac 


cacccagttg 


atgtctattg 


tggcttttta 


1140 


ataggaggag 


gaattgcact 


gtacttgggc 


ttgtatgctg 


tggggaattt 


cctgcccagt 


1200 


gatgagagta 


tgtttcagca 


cagagacgcc 


ctcaggtctc 


tgacagacct 


caatcaagat 


1260 


cccaaccgac 


ttttatctgc 


taaaaatggt 


agcagcagtg 


atggaattgc 


tcatacagaa 


1320 


ggcatcctca 


accgaaacca 


cagagatgct 


agctctctga 


caaatctcaa 


aagagcaaat 


1380 


gctgatgtgg 


aaatcattac 


tccacggagc 


cccatgggga 


aggagaacat 


ggttaccttc 


1440 


agcaatacct 


tgccgcgagc 


caatacccca 


tctgtagaag 


accctgtcag 


aagaaatgcg 


1500 


agcattcatg 


cctctatgga 


ttccgctcga 


tcaaagcagc 


tcctcaccca 


gtggaagaat 


1560 


aagaatgaaa 


gtcgaaagtt 


gtccttgcaa 


gttatagagc 


ctgagcctgg 


gcagtcacca 


1620 


cccagatcca 


tagaaatgag 


gtcaagctca 


gagccatcga 


gggtaggggt 


gaatggagac 


1680 


caccatggtc 


ctggcaatca 


gtacctcaaa 


atccagcctg 


gcgctgtccc 


cggatgtaac 


1740 


aacagcatgc 


ctggagggcc 


aagagtgtcc 


attcagtccc 


gtcctgggtc 


ctcacagttg 


1800 


gtgcacatcc 


ctgaggagac 


tcaggaaaac 


ataagcacct 


cccccaaaag 


cagctctgct 


1860 


cgggccaagt 


ggttaaaagc 


tgctgaaaag 


actgtggcct 


gtaacagaag 


caacagccag 


1920 


ccccgaatca 


tgcaagtcat 


agccatgtcc 


aagcagcagg 


gtgtcctcca 


aagcagcccc 


1980 


aagaacactg 


aaggcagcac 


ggtctcctgc 


actggctcca 


tccgctataa 


aaccttgaca 


2040 


gaccatgagc 


ccagtgggat 


agtgagggtt 


gaggctcacc 


cagagaacaa 


caggcccatc 


2100 
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atacagatcc 


cgtccactga 


aggtgaaggc 


agtggctcct 


ggaagtggaa 


agcccctgaa 


2160 


aagggcagcc 


ttcgccaaac 


ttacgagctc 


aacgatctca 


acagggactc 


agaaagctgt 


2220 


gagtctctga 


aagacagctt 


tggttctgga 


gatcgcaaga 


gaagcaacat 


tgatagcaat 


2280 


gagcatcacc 


accacggaat 


taccaccatc 


cgcgtcaccc 


cagtagaggg 


cagcgaaatt 


2340 


ggctcagaga 


cgctgtccat 


ttcttcttcc 


cgcgactcca 


ccctgcggag 


aaagggcaat 


2400 


atcattctaa 


tccctgaaag 


aagcaacagc 


cccgaaaaca 


ctagaaatat 


cttctacaaa 


2460 


ggaacctccc 


ccacacgggc 


ttataaggat 


tgagtgatgt 


ccattccatc 


attaocracta 


2520 


ctcgcaaaag 


accatatgtt 


gattctacct 


gtgttctgtt 


ccagcgaatt 


QQcraaotctc 


2580 


accaagctag 


attgtctacc 


atcagcccag 


aactctgtaa 


cttttcagaa 


ct get a tact 


2640 


caaacttgca 


gatctcacat 


caaggagagg 


gaaaagcaca 


atgcaagaac 


ct aact a a eg 


2700 

£. 1 \J\J 


tgatgatatg 


aagagttttc 


ttaagacctg 


tcgtcaaact 


taaaaggttt 


tacaaaooac 

73 ^5 *H ^5 *H 


27 60 


agtatcaaaa 


gaaagtggtt 


ttcttcaaat 


gtatactatt 


ttacttcctg 


aatataccaa 


2820 


ctttggggat 


ttttctttat 


agtgagctgt 


gggaacccag 


aacacacacg 


ttttccctac 


2880 


agcagaggcc 


atgcagtatt 


atatattcat 


tttgcagaat 


ctgcacctac 


agctcaatac 


2940 


gggtggtgct 


gattattata 


gtacatatac 


catgtaaact 


ctcaaactct 


atttaactcrt 


3000 


gaaatagtgg 


tgtgcaattc 


cttgttaaag 


aaatgctact 


ttattaagaa 


gat gctggct 


3060 


gctttgtgtt 


agaataggac 


accccgcagc 


ttctctgtag 


tggctctgt c 


acagtcaaaa 


3120 


aatgaaaagg 


tttttgtgcg 


tttcttcaaa 


attctgcttt 


cttcaacatc 


aaaaattgtg 


3180 


tagaaatatt 


ttcagtgaaa 


gggaataact 


agtacttttc 


tgcatagttt 


ttcttctget 


3240 


tactttttat 


ttaagtatag 


gtactgctaa 


tgaatctgtt 


ttcttagtga 


crt aaattt or 


3300 

~J *J \J KJ 


ataattttat 


aaatattatt 


ttagagaatc 


ttttgaaatt 


gttgtgat ca 


tattttgett 


3360 


tctatggctt 


ctccttaact 


tattgattaa 


ttttttgaag 


ttatagatat 


gttctcct at 


3420 


tttaaaagca 


aaaataacaa 


ttgacattcc 


ttgagcaaaa 


tatactgctg 


t gaatttgea 


3480 


aacaagaaat 


ctgagccaaa 


acttgacatt 


gtgggttaca 


ttgccagaaa 


tgttggtcaa 


3540 


gtttgccctt 


agatgtctac 


aactagctgg 


cataggttgc 


catcttaaca 


agtaatctaa 


3600 


aagtcccatt 


cggttctaca 


ttattaactt 


tttttttcta 


tatcctgatg 


accagtaaat 


3660 


tagagccaca 


ctggttaagt 


ttgactcgtc 


tctaaaacgt 


ttttgttaat 


tggacaccaa 


3720 


gaggaagaat 


ctgaaaaaaa 


aatgcatgtt 


ggtaagtaaa 


agtatctcac 


ggtacaaatt 


3780 


aayaatydCL 




tatctgaata 


ggtgcagttt 


tagtttaaca 


tgcaaacaac 


3840 


cattgttgct 


acctatcctg 


aatcaagcct 


tgagcctaaa 


tcaaagcaaa 


ccaataccat 


3900 


tgataagaag 


aagataaaaa 


caaaatattt 


tggagtgttt 


tccaacttaa 


agtatgaaga 


3960 


catactcagt 


tcttggaact 


tagtattaaa 


ccttttttat 


gccatttcat 


aagaattccg 


4020 
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atatatactt 


gatgattgcc 


aaggggatga 


aaggaaacaa 


cagagatggt 


tgatctgatc 


4080 


ttagctcact 


ttccaataac 


agaaggagtt 


gtttacagat 


gaatagtatc 


acatcattat 


4140 


caatttccac 


atgaaaaagg 


tggagctttc 


tagaaaaacc 


aacctctaag 


gcattaggaa 


4200 


tttagctgaa 


accagcagaa 


ttgaaaactc 


tggcaataaa 


acatggactc 


aaccatatcc 


4260 


cttctggcaa 


tttccttctc 


agagagggga 


gtgggaataa 


aatgttgcct 


tccccacttc 


4320 


tcaccaccac 


cgccatcatg 


acgctcatac 


tggcttttgc 


ctgtttgtag 


aggaaaaggt 


4380 


gggctggttt 


tagtactctg 


aaggacaaaa 


acaagcaaac 


aaaaacccct 


gctgcagcat 


4440 


ttcaggtgca 


gtatgatatt 


tcctaatctt 


tcctatttct 


taacaaaaga 


ttttaaagta 


4500 


cttctctagt 


cattgaagtt 


tttttttctt 


tacataaata 


ttgatatatt 


ctttttctac 


4560 


tcaaagtgcc 


aaaggctaca 


gtttttaatg 


acttaacaaa 


ttgtaccaca 


ttgttaagga 


4620 


catataatga 


tagacactag 


aactcagacc 


tctgcatgta 


tatttgataa 


catgtctttt 


4680 


gtaaaacaaa 


aattacaaaa 


aaatttgttt 


acattccact 


ggtaccttaa 


tttaaaataa 


4740 


atcagactaa 


aaggtggtat 


ctcttcttag 


tgttctattt 


atcttatttg 


ctaatgggag 


4800 


cacttcttcc 


tttgttaggc 


tgtgctttac 


tgataaaacc 


aagtattgaa 


taaagagagt 


4860 


taattatctt 


tttaaagtaa 


ataaaattat 


gaaaatatat 


atagtatata 


taaagtcctg 


4920 


tgtttaaaaa 


aatgttatgc 


aatgttttcc 


aaactgataa 


agtttgtaaa 


gtgctataaa 


4980 


tgtattttgt 


taagtacaga 


taaaagctat 


tgtgtgagta 


tattgtgcta 


aaatcataga 


5040 


aataaagatt 


agatttcttc 


atcaaaaaaa 


aaaaaaaaaa 


aaa 




5083 
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aggagacgcg 


ctttgtgctg 


ggcgccggcc 


gcgccagcca 


cggcctgcgg 


cgcccgcggc 


60 


accatgatct 


ccaccaagga 


gaagaacaag 


atcccgaagg 


acagcatgac 


gcttctgccc 


120 


tgcttctact 


tcgtggagct 


gcccatagtg 


gcttcttcca 


tcgtatcctt 


gtacttcctg 


180 


gagctgaccg 


acctcttcaa 


gccggccaag 


gtgggcttcc 


agtgctatga 


ccgcactctc 


240 


tccatgccct 


acgtggagac 


caacgaggag 


ctcatcccgc 


tgctgatgct 


gctcagcttg 


300 


gccttcgcgg 


cccctgccgc 


ctcgatcatg 


gtggccgagg 


gcatgttgta 


ctgtctgcag 


360 


tcccggctgt 


ggggccgtgc 


cggggggccc 


gccggggcgg 


agggcagcat 


caacgccggc 


420 


ggctgcaact 


tcaactcctt 


cctgcggcgt 


acggtgcggt 


ttgtgggtgt 


ccacgtgttc 


480 


ggcctgtgtg 


ccacagccct 


ggtgacggac 


gtgatccagc 


tggccacggg 


ttaccacact 


540 


cccttcttcc 


tcaccgtctg 


caagcccaac 


tacactctcc 


tgggcacgtc 


ctgcgaggtc 


600 
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aacccctaca 
cggaagacct 
gtgagtccgg 
ctgaccccaa 
ctgctgaagc 
cagatcacgc 
ggcatcgctg 
aagcccgcgg 
gactcggttt 
ctggagggcg 
gccagcgtgg 
accttcagcc 
cacatgacca 
aagcagaaga 
ctgcgcgcgc 
gaggaggagg 
gtgcaggcgc 
ccgctggtgc 
ggcgccgggg 
aaccctccgc 
aaggcggccg 
ccgtcggacc 
gtggtgcacc 
aaggcggagg 
gccaagcccc 
cggttcgggg 
gcggccgatg 
ggcctggggc 
caggcgcgcc 
gggcgcgggc 



tcacgcagga 
tcccgtccca 
cacctcactg 
cccccatgcc 
ccatcctggt 
agtaccgcag 
cctacctggc 
ccccggcccc 
atcagcagaa 
cgccccggcc 
acgtggacct 
acacgctgcc 
tccacgtgcc 
gcctggaggg 
ccgccgaacc 
aggaagagga 
ggccggggct 
acatcccgga 
tgcgcgccaa 
ggctgctgca 
agacggcgtc 
gcgactccgc 
tgtcggccgg 
ccgacggcgg 
cgggcgtgtc 
ccgtggccac 
gggcgctggg 
tggcggagcg 
gcttccccga 
ggccgc 



catctgctcc 
gcacgccacg 
cccttcccag 
ccagatgtac 
cttcgccttt 
ccaccctgtg 
ctgccacgcg 
cgccaaggac 
taagtcggtg 
cgtggcccgc 
gctggccccg 
cagggccagc 
gctggacgcc 
ccgcggcctg 
catggcggag 
ggaggacgag 
ggggcctcgg 
ggagggcgcg 
gtggctcatg 
ggtcatcgcc 
gtcgtccagc 
cagcatcgtg 
cggcgcgccc 
ctacgagctg 
ccccggctcg 
cgtcaacctg 
cccgggcagc 
cgaggcggag 
ctagcgcggc 



ggccacgaca 
ctgtcagcct 
gccctcttgc 
ttcaactcgg 
gccatcgccg 
gacgtgtatg 
gtgggcaact 
gcgctgcggg 
agcaccgacg 
gagaagacct 
cgcagcccca 
gcgccctcgc 
tcgcgctcca 
gggctgcccg 
gaggaggaag 
ggcccggccc 
gtcatcctcc 
cagacggggg 
atggccgaga 
atgtccaagg 
gccagctccg 
accatcgacg 
tgggagtgga 
ggggacctgg 
tcggtcagcg 
gccacgggcg 
cgggagtcca 
gcggaggccg 
ggggccgggg 



tccacgccat 
tcgccgcggt 
tgacccgtgg 
tcatctcgga 
cgggcgtatg 
ccggcttcct 
tccaggcccc 
ccctgacgca 
agctggggcc 
cgctgggcag 
tggccaagga 
tggacgaccc 
agcagctcat 
acgacgccag 
aggaggagga 
cgccctcgct 
caccgcgcgc 
ccggcctgtc 
agagcggggc 
ctccgggcgc 
actcctcgca 
cgcacgcgcc 
aggcggcggg 
cgcgcggctt 
acgtggacca 

aggggctgcc 

cgctgcggcg 
agggctactt 
gcgggcgggg 



cctgtctgca 
ctatgtgtcg 
ggagccctcc 
caccaccaag 
cgggctcacg 
catcggggcg 
acctgcagag 
gcggggccac 
cccagggcgg 
cctgaagcgc 
gaacatggtg 
cgcgcgccgc 
cagcgagtgg 
ccccgggcac 
cgaagaggaa 
ctaccccacc 
ggggccgccg 
ccccaaaagc 
ggcagtggcc 
gccgggcccc 
gtaccggtcg 
gcaccacccc 
cggcggggcc 
ccgcggcggg 
ggaggagccg 
cccgctgggc 
ccacgcgggc 
ccgcaagatg 
ggcgggccga 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2356 
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<220> 

<221> misc_f eature 

<222> (460) . . (460) 

<223> n is a, c, g, or t 

<400> 8 



ygagcgatga 


tgccccattt 


accctttctc 


ttcagatgca 


ggaaattttc 


actctgttcc 


60 


ccagctgatt 


ggagcttttt 


ctaggtgctt 


ccctgggagt 


tacctcccta 


gagatcagca 


120 


ggcagggctg 


tcacgcttgg 


gtagcagcca 


gctcccagtg 


aattccttct 


gtggcctact 


180 


tgtccttatg 


aagtccgagt 


tttaattttg 


cacaggtagg 


aggtctcttt 


tgctatggat 


240 


agggcggata 


acggtgctac 


cattagaaaa 


caggcttctg 


ttttctagga 


aggcaagagg 


300 


aaccccaggt 


aggggacctt 


gtgagaccag 


gtgacttggc 


tcctcagcct 


tgcttctaca 


360 


gaaaccagga 


gtgcttcccc 


ccactcttcc 


ctatttttga 


cgtcaagctc 


aaccagccag 


420 


cagaggagcc 


tcacggcttg 


ggcggtggag 


agagagcccn 


aggagagtgg 


cagggagggg 


480 


aagccatctc 


agcaacagct 


tggagaggga 


gctgctatcc 


cttgcccgca 


aaacacggac 


540 


taaagccagg 


ctgaagaaga 


cctgcgggct 


cgggctcggg 


gatccgcggg 


gttactgcaa 


600 


agaagggcgg 


ggaaaaggcg 


ggggcgctgc 


atgcaacgcg 


ctggttccag 


cggtgcccgc 


660 


ggggaatgtg 


acatcagcgg 


cgccgggcgc 


ttgcggctgg 


agcaggcagc 


tcgcctcggt 


720 


ggccgcacgg 


tgcacacctc 


gcccggggga 


ggacttggag 


cccggcaggc 


ggccgggatg 


780 


tcggcgaagg 


agaggccaaa 


gggcaaagtg 


atcaaggaca 


gcgtcaccct 


cctgccctgt 


840 


ttttatttcg 


ttgagttgcc 


tatattggca 


tcatcagtgg 


ttagcctcta 


cttcttggaa 


900 


ctcacagatg 


tcttcaaacc 


tgtgcactct 


ggattcagtt 


gctatgatag 


gagtcttagc 


960 


atgccgtaca 


ttgagccaac 


ccaggaggcc 


ataccattcc 


ttatgttgct 


tagcttggct 


1020 


tttgctggac 


ctgcaattac 


gatcatggtg 


ggtgaaggga 


ttctatactg 


ctgcctctcc 


1080 


aaaagaagaa 


acggagctgg 


attggagcct 


aacatcaacg 


ccggaggctg 


caacttcaac 


1140 


tcctttctca 


ggagagccgt 


cagattcgtt 


ggtgtccatg 


tgtttggact 


gtgctccaca 


1200 


gctctcatta 


cagatatcat 


acagctctcc 


acaggatatc 


aggcaccata 


ctttctgact 


1260 


gtgtgcaagc 


caaactatac 


ctctctgaat 


gtatcctgca 


aagaaaactc 


ctacatcgtg 


1320 


gaagatattt 


gttcaggatc 


tgaccttaca 


gtcatcaaca 


gtggcagaaa 


gtcattccca 


1380 


tcccaacatg 


cgaccctcgc 


tgcctttgcc 


gctgtgtatg 


tgtccatgta 


cttcaattcc 


1440 


acattaaccg 


attcctctaa 


gctcctgaaa 


cctctcttgg 


tcttcacatt 


tatcatctgt 


1500 


gggatcatct 


gcggactaac 


acggataact 


caatataaga 


accatccagt 


cgatgtctat 


1560 
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tgtggctttt 


taataggagg aggaatcgca ctatatttgg gcctgtatgc 


tgtagggaat 


1620 


tttttgccta 


gtgaagacag 


tatgcttcag 


cacagagatg 


ccctcaggtc 


actgacagac 


1680 


ctcaatcaag 


accccagcag 


ggttttatca 


gctaaaaatg 


gtagcagtgg 


tgatggaatt 


1740 


gctcacacag 


agggtatcct 


caaccgaaac 


cacagggatg 


caagctcctt 


gacaaatctc 


1800 


aagagggcca 


acgctgacgt 


agaaatcatc 


actcctagga 


gccccatggg 


gaaggaaagc 


1860 


atggtgacct 


tcagcaacac 


gctgcccagg 


gccaacaccc 


cctccgtgga 


agacccagtg 


1920 


agaagaaatg 


cgagcatcca 


tgcctctatg 


gattctgccc 


ggtccaaaca 


gctccttacc 


1980 


cagtggaaga 


gcaagaatga 


gagtcgtaag 


atgtccctac 


aggttatgga 


cactgaacca 


2040 


gaaggccagt 


caccacccag 


gtccatagaa 


atgaggtcca 


gctcagagcc 


ctcgagggtg 


2100 


ggggtgaacg 


gagatcacca 


tgtccctggc 


aatcagtacc 


tcaagataca 


gcctggcaca 


2160 


gtccccgggt 


gcaacaatag 


tatgccggga 


gggccacgcg 


tgtccatcca 


gtcccgccct 


2220 


ggctcttccc 


aattggtgca 


catccccgag 


gagacccagg 


aaaacataag 


cacctcgccc 


2280 


aagagcagtt 


ctgcgcgagc 


caagtggctg 


aaagcagctg 


agaagaccgt 


ggactgtaac 


2340 


cggagcaaca 


accagccacg 


catcatgcag 


gtcatcgcca 


tgtccaagca 


gcagggcgtg 


2400 


ctgcagagca 


gccccaagaa 


tgccgaaggt 


agcactgtca 


cctgcacagg 


ttccatccgc 


2460 


tacaaaaccc 


tgactgacca 


tgagcccagc 


ggcatcgtgc 


gagtggaggc 


tcatcccgag 


2520 


aacaacaggc 


ccatcattca 


gatcccgtcg 


tccactgagg 


gtgaaggcag 


cggctcctgg 


2580 


aagtggaaag 


ttccggagaa 


aagtagtctg 


cgccaaacct 


atgagctcaa 


cgacctcaac 


2640 


agggactcag 


aaagctgtga 


gtccctcaaa 


gacagctttg 


gttctggaga 


tcgcaaaaga 


2700 


agcaacatcg 


acagcaatga 


gcaccaccac 


cacggcatca 


ccaccatccg 


agtgaccccg 


2760 


gtggagggca 


gcgagatagg 


ctcagagacg 


ctgtccgtgt 


cctcctcacg 


cgactccacc 


2820 


ctgcgcagga 


agggcaacat 


catcttgatc 


ccggaaagaa 


gcaacagccc 


tgaaaacaca 


2880 


agaaacatct 


tctacaaagg 


aacctccccc 


acgcgggctt 


ataaggattg 


agagatggcg 


2940 


gcccttcttg 


tcatcatttt 


gatgacaccc 


ccacctcccc 


atcccccacc 


ctcaccccaa 


3000 


gaccactcgt 


ttattgtacc 


ttgtgctctt 


ttgggttttt 


tgttttgttt 


tgtttggggg 


3060 


cctttttttt 


tccctagaag 


atatggagag 


ccttcttgtc 


caactagatt 


gttcaccatc 


3120 


agcctggaac 


tctcactgaa 


ccaccacaga 


aatcgtggcg 


attttacacc 


aagggaaagg 


3180 


aaaagcacaa 


agcaagaccc 


gaactaaact 


catcatcaga 


acacrt - 1* rt t- a 


agacacaggc 


3240 


tttgcagaag 


gtagtattaa 


gataaagtgg 


tttcctccga 


tgtatagtat 


ttaactttct 


3300 


gaatgtgcca 


acttaatgga 


gttttttttt 


ttcattataa 


ttagctgtgg 


gaacccaaaa 


3360 


cacataggtt 


ttcccaacag 


cagaggccat 


gcggtattat 


atattattca 


tttttgcaga 


3420 


ctctgcacca 


gaagagcaga 


ctgggtggtg 


ctgattatca 


cagtgcatct 


accatttaaa 


3480 
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ctctcaaact 


ctatgtagct 


gtgaaatagt 


ggtgtgcaac 


tcctcgtcag 


agaaatgcta 


3540 


cttcattcag 


aagacgccag 


tgactttgtg 


ttagaataga 


ccattcttgg 


cttccctgta 


3600 


gtggctctct 


cacagttgaa 


aagaaaagaa 


aagaaaaaga 


aaaagaaaga 


gagagagaga 


3660 


aagagagaaa 


gaaagaaaga 


aagaaagaaa 


gaaagaaaga 


aagaaagaaa 


gaaagaaaga 


3720 


aagaaagaaa 


gaaagaaaga 


aagaaagaat 


tggatgaatt 


ggacagggct 


ttgagcattt 


3780 


ctttgaaaga 


tgcttttttt 


caacatctga 


aagcttgtag 


gaatgttttc 


agtgaaacag 


3840 


aataactagt 


tctctgcatc 


gtttttcttc 


tttttattta 


agtattggta 


atgctgcttt 


3900 


ctggtttttt 


gttttttgtt 


ttagtgagtg 


catttgcata 


tttaaaatac 


attgttttag 


3960 


agaatatttt 


gaaattatta 


tgattacatt 


ttccatttta 


tggctttacc 


ttagtttatt 


4020 


aagttttctg 


agggttacac 


atattcttct 


attttaagaa 


agcaaaagtg 


acaacttgca 


4080 


ttctttgtgc 


aaaatacact 


gctgtgaggt 


cctacactag 


aaatctgagc 


caaaggttga 


4140 


aactgtgcgt 


gccaatgcca 


gatacgctgg 


tcaaggtcaa 


gatgtctcca 


atccgatggc 


4200 


ataggttatc 


acatcagtaa 


gtaatcccaa 


aatttcattt 


tgttccagag 


catttcattt 


4260 


tcatgttatc 


ttgataatca 


ccatattgga 


gccacagtgg 


gggtgagttt 


gactcccttt 


4320 


cctgacacac 


ttttaactgc 


acaccaacag 


taagaatcta 


ggcaaatgct 


aattgataaa 


4380 


tagatgtgta 


tcacagtata 


agtttagaaa 


gcatatcttc 


aaaatgtcag 


accaggtaaa 


4440 


gctttcgtgc 


ttagagtata 


accaacagtt 


ttggatgtct 


gtcttgaatc 


tagaacctta 


4500 


agcctaaatc 


aaaggaaacc 


ttactgttga 


tagcaagaag 


ataacaacat 


atttttgaag 


4560 


tggttttcca 


agctagctgt 


ttaaagtgtg 


gagaaggatt 


tggttcttga 


aatttggtat 


4620 


taaccttttt 


tcatgccatg 


tcttaagagt 


tataatgtac 


actcgatgat 


tgccaagaga 


4680 


gggggagggg 


gaggaaaaca 


gccaacagca 


gagctggttg 


gtctgaactc 


agtgcagttt 


4740 


tcaatgagaa 


caacagctgt 


ccagcaagga 


atcatatcat 


ccattctcag 


cttctacatt 


4800 


caaagggcag 


agctttttag 


aaaactcaac 


ctcctaaggc 


attaggaatt 


gagctgaaac 


4860 


cagcagaatt 


gaaaactctg 


gcaataaaat 


atagactcaa 


tcgtaaccct 


tctggcaagt 


4920 


tccttctcag 


agaaggaagt 


gggagtaaaa 


tgtggccttc 


cccacttctt 


tacatcaccc 


4980 


ctgtcacaat 


gtccccgctg 


gcctggccag 


tttcgagagg 


gaagggtgga 


ctggttttag 


5040 


tactctgaag 


aaaacccaag 


ctgcagtatt 


tgaggtgcag 


tataatattt 


cctaatcttt 


5100 


cctatttctt 


aacaaaaaaa 


gattttaaag 


tacttctcta 


ctcattgaat 


ttttgttctt 


5160 


tacatactat 


tgatatattc 


tttttctact 


caaaagtgcc 


aaaggctaca 


gtttttaatg 


5220 


acttaacaaa 


ttgtaccaca 


ttgttaagga 


aatataatga 


tagacactag 


aattcagacc 


5280 


tctgcatgta 


tatttgataa 


cacatctttt 


gtaaaaaata 


aataattaca 


aaaaatttgt 


5340 
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ttacattcca 


caggtacctt 


aatttaaaat 


aaatcagact 


aacaggtggt 


atctcttctt 


5400 


agtgttctat 


ttatcttatt 


tgctaatgag 


aacaattctt 


cttctgttag 


gctgtgcttt 


5460 


attgataaaa 


ccaagtattg 


aataaaggga 


gttaattatc 


tttttaaagt 


aaatgaaatt 


5520 


ataaatatat 


aatatatata 


aaotattota 


^- LaaLaaaa 


L LalyCdd 


ugttttccaa 


5580 


actgataaag 


tttgtaaagt 


gctataaatg 


tattttgtta 


agtacagatc 


aaagctatcg 


5640 


tgtgagtata 


ttgtgctaac 


atcatagaaa 


taaagattag 


atttcttcat 


caaaaaaaaa 


5700 


aaaaaaaa 
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<210> 9 

<211> 2783 

<212> DNA 

<213> Mus musculus 

<400> 9 

ggcgcggggc gcgggatctc gccttcccca cactgcgccg cgcgctccta ggtcccggag 60 

atcaaccgtg ctcgcgcaag gcgcacagaa ggcacgcgca aggggcaggg acgagaccca 120 

gattgcggtc cccgcgcagg gccagagacc ctgagacccg cagagtggga ggcggaggcg 180 

cccaagccgg gggcggggtt ggggcggggc cgagcgggct gcagagcgcg cagacaaaga 24 0 

gctcacgcgc cgcacaccgc gcccgggcct cgcgtccctc ccacgcgggg accctggctt 300 

tgtgtccgcg ctttgcaagg gactccgccc tccttgctcc ctccaggtga acgagccgcg 360 

tgcgcgcgcg ggggctccgg cctctccctt ccccaagttc cccctctgct ccagcctcag 4 20 

tttccccgag ccctcttggc taacccttat ctccccacct ggagactcca tctgcgcagc 4 80 

tcctagtctc cagcctgcct ttgtccccct gcaatctcgg cctacagcgc caacgtcacc 54 0 

atgcttgcta tgaaggagaa gaataaaacg ccgaaggaca gcatgacact cctgccttgc 600 

ttttatttcg tggagctacc catcgtggct tcctccattg tgtctctgta cttcctggag 660 

ctgactgacc tgttcaagcc ggccaaggtg ggctttcagt gctacgaccg cgccctgtcc 720 

atgccctatg tggagacaaa tgaagagctg atcccattgc tcatgctgct gagcctggcc 780 

ttcgctgcac ctgcggcctc aatcatggtc ggcgagggca tggtctactg tctgcagtcc 84 0 

aggctctggg gccgtggtcc agggggcgtg gagggcagca tcaacgctgg tggctgcaac 900 

ttcaactcct tcctccggcg cacagtgcgc tttgtgggtg t.acacgtgtt tggcctgtgt 960 

gccacggctc tggtgacaga tgtcatccag ctggccaccg gctaccacac acctttcttc 1020 

ttaacggtct gcaaacccaa ttacaccctg ctgggcactt cctgtgagtc gaacccttac 1080 

atcacacagg acatctgctc tggccacgat acccatgcca tcctgtcagc aaggaagacc 1140 

ttcccgtccc agcacgccac tctgtcagcc tttgctgctg tctacgtgtc gatgtacttc 1200 

aacgcggtta tctcggacac cacgaagctg ctgaagccca tccttgtgtt tgccttcgcc 1260 
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attgctgcgg 


gcgtctgcgg 


cctcacacag 


atcacccagt 


atcgaagcca 


ccctgtggac 


1320 


gtctacgctg 


gctttcttat 


cggtgctggc 


atcgctgcct 


acctggcctg 


ccacgctgtg 


1380 


ggcaacttcc 


aggcaccacc 


tgcagaaaag 


gttcctaccc 


cagctcctgc 


caaggacgcc 


1440 


ctgcgagcgc 


tgacacagcg 


gggccatgag 


tccatgtatc 


agcagaataa 


gtctgtcagt 


1500 


accgatgagc 


tgggccctcc 


agggaggcta 


gagggcgtgc 


ctcggcctgt 


ggctcgagag 


1560 


aagacatctc 


ttggcagctt 


gaagcgagcc 


agcgtggatg 


tggacctgct 


ggccccacgt 


1620 


agccccatgg 


gcaaagaagg 


catggtcacc 


ttcagcaaca 


cactgccccg ggtcagcacg 


1680 


ccctcgctgg 


atgaccctgc 


acggcgccac 


atgactatcc 


acgtgcccct 


tgatgcctcc 


1740 


cgttccaggc 


agcttatcgg 


tgagtggaag 


caaaaatccc 


tggagggacg 


tggcctgggt 


1800 


ctgcctgatg 


aagccagccc 


tgtgcatctg 


agggccccag 


cagagcaggt 


agcagaggag 


1860 


gaagaggaag 


aggaggagga 


ggaggaagaa 


gaggaggaag 


aggaggaaga 


ggaagggcct 


1920 


gttccaccct 


cactctaccc 


tactgtccag 


gctcggccag 


gccttgggcc 


ccgagtcatc 


1980 


ctgcctccaa 


ggccagggcc 


ccagcccctc 


gtacacatcc 


ctgaggaagg 


cgttcaggct 


2040 


ggagctggcc 


tgtcacccaa 


gagcagcagc 


tcatcagtga 


gggccaagtg 


gctgtcagtg 


2100 


gctgagaagg 


gtgggggccc 


agtggctgtg 


gctccatccc 


agccccgggt 


ggccaaccca 


2160 


cccaggctac 


tacaggtcat 


cgctatgtcc 


aaggcggcag 


ggggccccaa 


ggctgagaca 


2220 


gcttcgtcct 


ccagtgccag 


ctccgactct 


tcccagtaca 


ggtccccgtc 


agaccgtgac 


2280 


tctgccagca 


ttgttacaat 


cgatgcccat 


gcacctcacc 


atccagtggt 


gcacctgtct 


2340 


gctggcagca 


caccctggga 


gtggaaggct aaagtagtgg agggtgaggg tagctatgag 


2400 


ctgggggacc 


tggcacgcgg 


cttcagaagc 


agctgtaaac 


agcctggaat 


gggcccgggg 


2460 


tctccagtca 


gtgatgtgga 


ccaggaagaa 


ccccggtttg 


gggcagtggc 


cactgtcaac 


2520 


ctggccactg 


gggagggtct 


gcccccacca 


ggtgcaagtg 


aaggggccct 


aggtgcaggc 


2580 


agcagagaat 


ccaccctaag 


gcgccaggtg 


ggcgggctgg 


cagagagaga 


agtggaggct 


? fid o 


gaggcagaaa 


gttattatag 


gaggatgcag 


gccaggaggt 


accaggacta 


agcctggcaa 


2700 


aactgtgatg 


gggattaggg 


gagctggggg 


tgtctacagc 


cccaccaata 


aaaacttatg 


2760 


ctattaaaaa 


aaaaaaaaaa 


aaa 








2783 



<210> 10 

<211> 5372 

<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<222> (1137) . . (1137) 

<223> n is a, c, g, or t 



f 
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<220> 

<221> misc_f eature 

<222> (2899) . . (2899) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (2917) . . (2919) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (2921) . . (2924) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (2929) . . (2929) 

<223> n is a, c, g, or t 

<400> 10 



atgccccaat 


tcaccctctc 


tcttcagatg 


caggcaaaaa 


attctcactt 


tQtt CCt CLOC1 

w w V— * v» 


fin 


cggattggag 


ctttttctag 


gtgcttccct 


tggagttacc 


t ccctagaga 


t cacraacrnt" o 

w u a y \u \, 


1 9H 
± ^ u 


agggctgtca 


cgcttgggta 


gcggccagct 


cccagtgaat 


tccttctatg 


gectacttgt 


i ftn 

lOU 


cagtatgaaa 


tctgagtttt 


aattttgcac 


aggtggaggt 


ctcttttgct 




9 4 n 

£ H U 


tggatagcgg 


tgcccagcaa 


gctcctgctc 


tctagaaagg 


caacaagaac 




J V V 


agaccaggtg 


ccttggctcc 


tcagccttgc 


ttctgcagaa 


accaggacrtcr 


cct cccccca 


J uu 


ctatttttaa 


L. V — • a cl y W W 






l ca eager rg 


ggcggtggag 


420 


agcccaggga 


gagtggcagg 


gaggggaagc 


catcacagca 


acagcttgga 


gggggagctg 


480 


gctatcactt 


tccctcaaaa 


tacggactaa 


aacccggctg 


aagaagacct 


gcgggatcag 


540 


ggaagcgccg 


ggttactgca 


aagaagggcg 


gggaaaagga 


gggggcgctg 


catgcagcgc 


600 


gctggctcca 


gcggtgcccg 


cggggaatgt 


gacatcagcg 


gcaccgggcg 


ettgeggctg 


660 


gagcaggcag 


ctcgcctcgg 


tggccgcgca 


gtgcacacct 


cacccacggg 


aggactegga 


720 


gcccggcagg 


tggccgggat 


gtcggcgaag 


gagaggccaa 


agggcaaagt 


gatcaaggac 


780 


agcgtcaccc 


tcctgccctg 


tttctatttc 


gttgagttgc 


ctatattggc 


gtcatcggtg 


840 


gttagtctct 


acttcctgga 


actcacagat 


gtcttcaaac 


ctgtgcactc 


eggattcagt 


900 


tgctatgaca 


ggagtcttag 


catgccgtac 


attgagccaa 


cccaagaggc 


cattccattc 


960 


ctcatgttgc 


ttagcttggc 


ttttgctgga 


cctgcaatta 


cgatcatggt 


gggtgaaggg 


1020 


attctgtact 


gctgcctctc 


taaaagaaga 


aatggagctg 


gattggagee 


caacatcaac 


1080 


gccgggggct 


gcaacttcaa 


ttcctttctc 


agaagagccg 


tcagatttgt 


tggtgtncac 


1140 


gtggttggac 


tgtgctccac 


agctctcatt 


acggatatca 


tacagcttgc 


cacaggatat 


1200 


caggcaccat 


actttctgac 


tgtgtgcaag 


ccaatgtaca 


cctccctgga 


aggatcctgc 


1260 
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aaggaaaatt 


cctacatcgt 


ggaagaaatt 


tgttcgggat 


ctgacctaac 


agtcatcaac 


1320 


aatggcaaaa 


agtcattccc 


gtcccagcat 


gcgaccctcg 


ctgcctttgc 


tgctgtgtat 


1380 


gtgtccatgt 


acttcaattc 


cacattaact 


gactcctcca 


agctcctgaa 


acctctcttg 


1440 


gtcttcacat 


ttatcatctg 


tggaatcatc 


tgcgggctaa 


cacggatcac 


tcagtataag 


1500 


aaccatcccg 


tcgatgttta 


ttgtggcttt 


ttaattggag 


gaggaatcgc 


gctgtatttg 


1560 


ggcctgtatg 


ctgtagggaa 


ttttctgcct 


agtgaagaca 


gcatgcttca 


gcacagagat 


1620 


gccctcaggt 


cactgacaga 


cctcaatcaa 


gaccccagca 


gggttttgtc 


agctaaaaat 


1680 


ggtagcagcg 


gtgatggaat 


tgctcacaca 


gagggtattc 


tcaaccgaaa 


tcacagggat 


1740 


gcaagctcct 


taacaaatct 


gaagagggcc 


aacgctgacg 


tagagatcat 


cactcctagg 


1800 


agccccatgg 


ggaaggagag 


catggtgacc 


ttcagtaaca 


cgctgcccag 


ggccaacacc 


1860 


ccgtccgtgg 


aagacccggt 


gagaagaaac 


gcgagcattc 


acgcctctat 


ggattctgcc 


1920 


cggtccaagc 


agctccttac 


ccagtggaag 


agcaagaacg 


agagtcgcaa 


gatgtccctg 


1980 


caggttatgg 


actctgaacc 


agaagggcag 


tcaccaccca 


ggtccataga 


aatgaggtcc 


2040 


agctcagagc 


cctcgagggt 


gggggtgaat 


ggagatcacc 


atgtcccggg 


caatcagtac 


2100 


ctcaagatcc 


agcctggcac 


agttcccggg 


tgcaacaata 


gtatgcctgc 


agggccacgt 


2160 


gtgtccatcc 


agtcccgccc 


tggctcttcc 


caattggtgc 


acatccccga 


ggagacccag 


2220 


gaaaacataa 


gcacctcgcc 


caagagcagt 


tccgcacggg 


ccaagtggct 


gaaagcagca 


2280 


gagaagaccg 


tggcctgtaa 


ccggggcaac 


aaccagccac 


gcatcatgca 


ggtcatcgcc 


2340 


atgtccaagc 


agcagggcgt 


gctgcagagc 


agccccaaga 


atgccgaagg 


tagcacggtc 


2400 


acctgcacag 


gctccatccg 


ctacaaaacc 


ctgactgacc 


acgagccgag 


tggcattgtg 


2460 


cgagtggagg 


ctcatcccga 


gaacaacagg 


cccatcatcc 


agattccatc 


gtccaccgag 


2520 


ggcgaaggca 


gtggctcctg 


gaagtggaaa 


gctccggaga 


agagtagcct 


gcgccaaacc 


2580 


tatgagctca 


acgacctcaa 


cagggactcg 


gaaagctgcg 


agtccctcaa 


agacagcttt 


2640 


ggttctggag 


atcgcaagag 


aaagcacatc 


gacagcaatg 


agcaccacca 


ccacggcatc 


2700 


acaaccatcc 


gcgtgacccc 


agtggagggc 


agcgagatag 


gctcagagac 


gctgtctgta 


2760 


tcctcctcac 


gcgactccac 


cctgcgcagg 


aagggcaaca 


ttatcttgat 


ccctgaaaga 


2820 


agcaacagcc 


ctgaaaacac 


aagaaacatc 


ttctacaaag 


gaacctcccc 


cacgaggcct 


2880 


tataaggatt 


gaatccccnc 


ccttctgcct 


catcatnnng 


nnnncatcnc 


ccaccccccc 


2940 


accccagcca 


cacatttatt 


atagtacctt 


gtgctctttt 


gggtattttg 


ttgttgtttt 


3000 


ggttttggtt 


ttttgttttc 


ttttctagtg 


gatatggaga 


gccttcttgc 


ccaactagat 


3060 


tgttcaccat 


cagcctggaa 


ctctcactga 


cccaccacag 


aaaccgtggt 


gattttccac 


3120 



28 



caagggaaag 


gaaaagcaca 


aagcaagacc 


cgaactaatc tcatcatctg aatagttctt 


J-lOU 


aaggcctgtc 


actgagactt 


cgcagagggc 


agtattagga 


taaagtggtc 


tcctccgatg 




tatagtattt 


aactttctga 


atgtgccaac 


tttgaagttt 


cttttttttt 


tttttcatta 




taattagcgg 


tgggaaccca 


aaacactaca 


ggtttccccg 


gcagcagagg 


ccacgcagta 


jjDU 


ttatatagta 


tttatttttg 


cagactctgc 


accagaagag 


cagactgggc 


ggtgctgatt 


ia 9n 

O H £. U 


accacagtac 


atctaccatt 


tcaactctca 


gactctgtgt 


agctgtgaga 


gagtggtgta 




caaccccctt 


gtcagagaaa 


tgctacttca 


ttcagaagac 


gccagtgact 


ttgtggtagc 




ataggccatt 


cttggcttcc 


ctqtaqtqqc 


tctctcacag 


tcaaaaagaa 


aaaagaaaga 


obUU 


aaaaactgga 


cacraattcta 


agcatttctt 


tgaaagattc 


tttgttttca 


acatttgaaa 


3660 


qtctataaaa 


at gt ttt egg 


tgagacggaa 


caactagttc 


tccgcatcgt 


ttttctgctt 


3720 


tttatttaag 


tattggtaat 


gctggtttct 


ggtatctttt 


ttttttagtg 


agtgcatttg 


J/80 


catatttaaa 


atacattgtt 


ttagagaata 


ttttgaaatt 


attataatca 


cattttccat 


J840 


tttgtggctt 


catcttagtt 


tattaagttt 


tttgaggtta 


tgcacaccat 


tcttctattt 


3900 


taagaaagca 


acagtgacca 


tttacattcc 


ttgtgcaaaa 


tacactgctt 


cgaggtccta 




cgcaagaaat 


atgagccaac 


ggtt gaaact 


gtgcgttcca 


ttgccagata 


cattggtcac 


A A o r\ 

4 U20 


attcaagatg 


tctccaatcc 


qatqqcataq 


gttatcacat 


cagtaagcga 


tccccaaact 


4 080 


tcattttctt 


ccagagcatt 


tcatttttgt 


gttatcctga 


taatcaccat 


atcggagcca 


A 1 A Ck 
H 1 4 U 


ccatgggggt 


gagcttgact 


cccttccctg 


acacactttt 


aactggacac 


caacagtaag 




gacctaggca 


aaggctaatt 


gatagataga 


agtgtatcac 


agcatacatt 


tagaaagtgt 


H z bu 


atcttcaaaa 


cagcagacca 


ggcaaagctt 


ttgtgcttag agtactattg acagttttgg 


4 oZQ 


ctgcctgtct 


tgaatctaga 


accttgagcc 


aaaatcaaag 


gaaaccatac 


tgttgatagc 




aagaagatta 


ccacaagata 


ttttt gaagt 


gttttcccag 


gttagatgtt 


taaagagtgg 


4 4 4 0 


agacatatat 


t tggtccttg 


aaatttgtta 


taaaaccttt 


ctcatgccat 


gtcttaagaa 


4 500 


ttataatgga 


cacttgatga 


ttgccaagag 


ggggagcaaa 


aacagcgaac 


agcagagctg 


4 o d0 


qttqqtctqa 


act cagt gca 


gctt t cagtg 


agaacaaaag 


ctgtccagca 


aagagtcaca 


4 620 


tcattcattc 


tctagcctct 


ccattcaaag 


ggcagagctt 


tttttttttt 


ttaagaaaac 


A £ o r\ 

4 680 


tcaacctctt 


aaggcattag 


gaatttagct 


gagaccagca 


gaattgaaaa 


ctctggcagt 


H I 4 U 


aaaatatagg 


ctcaaccata 


acccttctgg 


aaagttcctt 


tttttttttt 


ttttcctttt 


4 o UU 


ttutatt ttt 


tttttttttt 


ttgtgatcga 


gttgactgca 


gtagttcatc 


atattctttt 


4860 


tctactcaaa 


gtgccaaagg 


ctacagtttt 


taaacgactt 


aacaaattgt 


accacactgt 


4920 


taaggaacta 


taacgataga 


cactagaatt 


cagacctctg 


catgtatatt 


tgataacaca 


4980 


tcttttgtaa 


aaaataaata 


attacaaaaa 


atttgtttac 


attccactgg 


taccttaatt 


5040 
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taaaagaaat cagactaaca ggtggtatct cttcttagtg ttctatttat cttacttgct 5100 

aacgagagca attcttcttt tgttaggctg tgctttattg atgaaaccga gtattgaata 5160 

aagagagtta attatctttt taaagtaaat aaaactatga atatataata tatatgaagt 5220 

atcgtgttta ataaaaatgt tatgcaatgt tttccaaact gataaagttt gtaaagtgct 5280 

ataaatgtat tttgttaagt acagatcaaa gctatcgtgt gagtatattg tgctggcatc 534 0 

atagaaataa agattagatt tcttcatcaa aa 5372 

<210> 11 
<211> 30 
<212> DNA 

<213> Rattus norvegicus 
<400> 11 

caccacagct gagagggaaa tcgtgcgtga 30 

<210> 12 
<211> 30 
<212> DNA 

<213> Rattus norvegicus 
<400> 12 

atttgcggtg cagcatggag gggccggact 30 

<210> 13 
<211> 51 
<212> DNA 

<213> Rattus norvegicus 
<400> 13 

gcagaggtct gaattctagt gtctatcgtt atagttcctt aacagtgtgg g 51 

<210> 14 

<211> 14 

<212> PRT 

<213> Artificial 

<220> 

<223> Peptide representing amino acids 4 90 to 507 of Rattus norvegicus 
PRG-1 protein with an added amino terminal cysteine 

<400> 14 

Cys Val Gly Val Asn Gly Asp His His Val Pro Gly Asn Gin 
15 10 

<210> 15 
<211> 26 
<212> DNA 

<213> Rattus norvegicus 
<400> 15 

ctaggcttgt agctgtgggg aatttc 26 



<210> 16 
<211> 22 
<212> DNA 

<213> Rattus norvegicus 
<400> 16 

tcaatcctta taagcccgtg tg 



<210> 17 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 
<400> 17 

gaactttgcg agtgagctgg 



<210> 18 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 
<400> 18 

tgcggagagc tttaacctcc 



<210> 19 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 
<400> 19 

cctacctctt cctcatgttc 



<210> 20 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 
<400> 20 

taaagggtgg agtccatcag 



<210> 21 
<211> 21 
<212> DNA 

<213> Rattus norvegicus 
<400> 21 

ggaattgcct ctgcaacatc t 



<210> 22 
<211> 18 
<212> DNA 

<213> Rattus norvegicus 
<400> 22 

gagtagatga tgggttca 
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Claims ^% C/ > 

1. Isolated protein comprising the same or substantially the same amino acid sequence 
selected from the group consisting of SEQ ID NOs: 1 to 5, or a splice variant or a salt 
thereof 



2. Protein according to claim 1, which comprises at least one fragment of said amino acid 
sequences. 

10 3. Nucleic acid, which comprises at least one nucleic acid encoding a protein according 
to claims 1 and/or 2. 

4. Nucleic acid according to claim 3, which consists of DNA or RNA. 

15 5. Nucleic acid according to claim 4 comprising a nucleic acid selected from the group 
consisting of SEQ ID NOs: 6 to 10. 

6. Nucleic acid according to one of claims 3 to 5 further comprising at least one pro- 
moter, enhancer, intron and/or polyA-sequence. 

20 

7. Nucleic acid, which is complementary to the nucleic acid according to one of claims 3 
to 6. 

8. Vector comprising a protein according to claims 1 or 2 and/or a nucleic acid according 
25 to one of claims 3 to 7. 

9. Vector according to claim 8 selected from the group of vectors consisting of plasmids, 
phagemids, phages, cosmids, artificial mammalian chromosomes, knock-out or knock- 
in constructs, viruses, in particular adenovirus, vaccinia virus, baculovirus, retrovirus, 

30 adeno-associated-virus, rhinovirus, HIV, adeno-associated virus (AAV), herpes sim- 

plex virus (HSV-1), lentivirus, filovirus and engineered versions thereof, naked DNA, 
virosomes, liposomes, nucleic acid coated particles, in particular gold spheres. 



Isolated cell comprising a protein according to claims 1 or 2, a nucleic acid according 
to claims 3 to 7 and/or a vector according to claims 8 or 9. 

Cell according to claim 10, which is a stem cell, a neuronal precursor cell or a neu- 
ronal cell, in particular an axon. 

Transgenic non-human animal generated from a cell according to claims 10 or 1 1. 
Antibody directed against a protein according to claims 1 or 2. 

Method of producing a protein according to claims 1 or 2 and/or a nucleic acid ac- 
cording to claims 3 to 7 comprising the steps of: 

a) cultivating a cell according to claims 1 0 or 1 1 , and 

b) isolating said protein and/or said nucleic acid. 

Method of isolating compounds interacting with a protein according to claims 1 or 2 
comprising the steps of: 

a) contacting said protein with at least on potentially interacting compound, 

b) measuring binding of said compound to said protein. 

Method according to claim 16, further comprising the steps of: 

a) selecting a binding compound, 

b) modifying the binding compound, to generate a variety of modified binding com- 
pounds, 

c) contacting said protein with each of the modified binding compounds, 

d) measuring binding of said modified compounds to said protein, and 

e) if needed repeating steps a) to d) for one or more times. 

Method of isolating functional interactors comprising the steps of: 



a) contacting a neuronal cell that comprises a wt nucleic acid coding for a protein 
selected from the group consisting of SEQ ID NOs: 1 to 5, a splice variant 
thereof, or a fragment thereof with a potential functional interactor, 

b) contacting the cell with a bioactive lipid phosphate, and 

c) measuring neurite movement. 

Method according to claim 17 further comprising the steps of: 

a) contacting a neuronal cell that comprises a mutant nucleic acid coding for a mu- 
tant of the protein selected from the group consisting of SEQ ID NOs: 1 to 5, or a 
splice variant thereof or that contains a knock-out of the wt nucleic acid coding 
for one of said proteins with a potential functional interactor, 

b) contacting said cell with a bioactive lipid phosphate, and 

c) measuring neurite movement. 

Method according to claims 17 or 18 further comprising the steps of: 

a) selecting a functional interactor, 

b) modifying the functional interactor, to generate a variety of modified functional 
interactors, 

c) contacting a neuronal cell as used in claim 17 a) and if needed a cell as used in 
claim 18 a) with each of the modified functional interactors, 

d) contacting said cell or cells with a bioactive lipid phosphate, 

e) measuring neurite movement, and 

f) if needed repeating steps a) to d) for one or more times. 

Method according to one of claims 15 to 19, further comprising the step of admixing 
the interacting compound or the functional interactor with suitable auxiliary sub- 
stances and/or additives. 

Pharmaceutical composition for the treatment of neuronal injuries or diseases com- 
prising a protein according to claims 1 or 2, a nucleic acid according to claims 3 to 7, a 
vector according to claims 8 to 9, a cell according to claims 10 to 1 1, an antibody ac- 
cording to claims 13, a binding compound isolated by the method of claims 15 or 16 



-4- 



and/or a functional interactor isolated by the method of claims 17 to 19 and if needed 
suitable auxiliary substances and/or additives. 

22. Use of a pharmaceutical composition of claim 21 for the production of a medicament 
5 for the treatment of neuronal diseases or injuries. 

23. Use of a protein according to claim 1 or 2 or a nucleic acid according to claims 3 to 7 
as a diagnostic marker for the diagnosis of a disease or disease state. 



10 24. 



Use according to claim 23, where the disease is a neuronal disease, a tumor disease or 
infertility. 
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Summary 

Lipid phosphate phosphatase proteins, genes coding for them, vectors and cells comprising 
them, antibodies directed against them, methods of identifying compounds binding to them 
5 and functional interactors as well as to the use of proteins, genes, vectors, cells, interacting 
compounds and functional interactors for treating neuronal diseases and/or injuries. 
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Figure 2 
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